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ABSTRACT 


Title  of  Dissertation:  Control  of  Smart  Actuators 


Xiaobo  Tan,  Doctor  of  Philosophy,  2002 


Dissertation  directed  by:  Professor  John  S.  Baras 

Professor  P.  S.  Krishnaprasad 

Department  of  Electrical  and  Computer  Engineering 


Hysteresis  in  smart  materials  hinders  wider  applicability  of  such  materials  in 
actuators  and  sensors.  In  this  dissertation  we  study  modeling,  identification  and 
control  of  hysteresis  in  smart-  actuators.  While  the  approaches  are  applicable  to 
control  of  a  wide  class  of  smart  actuators,  we  illustrate  the  ideas  through  the 
example  of  controlling  a  magnetostrictive  actuator. 

Hysteresis  exhibited  by  magnetostrictive  actuators  is  rate-independent  when 
the  input  frequency  is  low  and  we  can  model  it  by  a  Preisach  operator.  It  becomes 
rate-dependent  when  the  input  frequency  gets  high  due  to  the  eddy  current  effect 
and  the  magnetoelastic  dynamics.  In  this  case,  we  propose  a  new  dynamic  hys¬ 
teresis  model,  consisting  of  a  Preisach  operator  coupled  to  an  ordinary  differential 
equation  in  an  unusual  way.  We  establish  its  well-posedness  and  study  its  various 
system-theoretic  properties.  Existence  of  periodic  solutions  under  periodic  forcing 


is  proved.  Algorithms  for  simulation  of  the  model  are  also  studied.  Parameter 
identification  methods  for  both  the  Preisach  operator  and  the  dynamic  model  are 
investigated. 

We  pursue  the  problem  of  hysteresis  control  along  three  different  but  connected 
paths:  inverse  control,  robust  control  and  optimal  control. 

The  idea  of  inverse  control  is  to  construct  an  inverse  operator  to  cancel  out 
the  hysteretic  nonlinearity.  Efficient  inversion  schemes  are  proposed  for  both  the 
Preisach  model  and  the  dynamic,  hysteresis  model.  We  also  formulate  and  study 
a  novel  inversion  problem,  called  the  value  inversion  problem,  and  apply  it  to 
micro-positioning  control. 

Inverse  compensation  is  open-loop  in  nature  and  therefore  susceptible  to  model 
uncertainties  and  to  errors  introduced  in  the  inverse  schemes.  We  propose  a  robust 
control  framework  for  smart-  actuators  by  combining  inverse  compensation  with 
robust  control  techniques.  We  present  systematic  controller  design  methods  which 
guarantee  robust  stability  and  robust  trajectory  tracking  while  taking  actuator 
saturation  into  account. 

Finally  we  study  optimal  control  of  hysteresis  in  smart  actuators  based  on  a  low 
dimensional  hysteresis  model.  We  characterize  the  value  function  as  the  (unique) 
viscosity  solution  to  a  Hamilton-Jacobi-Bellman  equation  of  a  hybrid  form,  and 
provide  a  numerical  scheme  to  approximate  the  solution. 
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Chapter  1 


Introduction 


Smart  materials,  such  as  magnetostrictives,  piezoelectrics,  electroactive  polymers 


(EAPs),  shape  memory  alloys  (SMAs),  elect rorheological  (ER)  fluids  and  magne- 
torheological  (MR)  fluids,  all  display  certain  coupling  phenomena  between  applied 
electromagnetic /thermal  fields  and  t.lieir  mechanical / rheological  properties.  Actu¬ 


ators  and  sensors  made  of  these  materials  can  be  built  into  structures,  often  called 


smart  structures,  with  the  ability  to  sense  and  respond  to  environmental  changes 
to  achieve  desired  goals.  Smart  materials  and  smart  structures  have  been  receiving 


tremendous  interest  in  the  past  decade,  due  to  their  broad  applications  in  areas 
of  aerospace,  manufacturing,  defense,  and  civil  infrastructure  systems,  to  name  a 


few.  Hysteresis  widely  existing  in  smart  materials,  however,  makes  the  effective 


use  of  smart  actuators  and  sensors  quite  challenging. 


A  fundamental  idea  in  coping  with  hysteresis  is  to  formulate  the  mathematical 
model  of  hysteresis  and  use  inverse  compensation  to  cancel  out  the  hysteretic 


effect.  This  idea  can  be  found  in  [45,  80,  71,  35,  79,  62],  There  have  been  a  few 


monographs  devoted  to  modeling  of  hysteresis  and  study  of  dynamical  systems 
with  hysteresis  [55,  58,  86,  20,  81], 


1 


Hysteresis  models  can  be  roughly  classified  into  physics-based  models  and  phe¬ 
nomenological  models.  An  example  of  a  physics-based  model  is  the  Jiles- Atherton 
model  of  ferromagnetic  hysteresis  [51],  where  hysteresis  is  considered  to  arise  from 
pinning  of  domain  walls  on  defect  sites.  The  most  popular  phenomenological  hys¬ 
teresis  model  used  in  control  of  smart  actuators  has  been  the  Preisach  model 
[1,  45,  46,  36,  38,  79,  62],  A  similar  type  of  operator,  called  Krasnosel’skii- 
Pokrovskii  (KP)  operator  has  also  been  used  [7,  35].  Although  in  general  the 
Preisach  model  does  not  provide  physical  insight  into  the  problem,  it  provides  a 
means  of  developing  phenomenological  models  that  are  capable  of  producing  be¬ 
haviors  similar  to  those  of  physical  systems  (see  Mayergoyz  [58]  for  an  excellent 
exposition). 

In  this  dissertation  we  study  control  methodologies  for  smart  actuators  ex¬ 
hibiting  hysteresis.  We  illustrate  the  ideas  through  the  example  of  controlling  a 
commercially  available  magnetostrictive  actuator.  Magnetostriction  is  the  phe¬ 
nomenon  of  strong  coupling  between  magnetic  properties  and  mechanical  prop¬ 
erties  of  some  ferromagnetic  materials  (e.g.,  Terfenol-D):  strains  are  generated  in 
response  to  an  applied  magnetic  held,  while  conversely,  mechanical  stresses  in  the 
materials  produce  measurable  changes  in  magnetization.  This  phenomenon  can  be 
used  for  actuation  and  sensing.  Magnetostrictive  actuators  have  applications  to 
micro-positioning,  robotics,  ultrasonics,  vibration  control,  etc.  Figure  1.1  shows 
a  sectional  view  of  a  Terfenol-D  actuator  manufactured  by  Etrema  Products,  Inc. 
By  varying  the  current  in  the  coil,  we  vary  the  magnetic  held  in  the  Terfenol-D 
rod  and  thus  control  the  motion  of  the  rod  head.  Figure  1.2  displays  the  hysteresis 
observed  in  the  magnetostrictive  actuator. 

We  study  the  problem  of  control  of  hysteresis  from  two  perspectives.  The  first 
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Figure  1.1:  Sectional  view  of  a  Terfenol-D  actuator  [82]  (Original  source:  Etrema 
Products,  Inc.). 

one  is  based  on  the  Preisach  model  and  the  theme  is  to  develop  accurate  and  fast 
inverse  control  algorithms.  The  second  perspective  is  optimal  control  based  on  the 
low  dimensional  bulk  ferromagnetic  hysteresis  model  [82,  84],  a  modification  of  the 
Jiles- Atherton  model.  We  now  outline  the  contributions  of  this  dissertation. 

1.1  Contributions  of  the  Dissertation 

We  note  that  although  the  dissertation  is  based  on  controlling  a  magnetostrictive 
actuator,  our  work  is  applicable  to  control  of  a  wide  class  of  smart  actuators  for 
two  reasons:  1)  the  Preisach  operator  is  able  to  model  hysteresis  in  various  smart 
actuators;  2)  a  low  dimensional  ferroelectric  hysteresis  model  has  been  proposed 
[72]  and  therefore  the  viscosity  solutions  approach  in  Chapter  5  applies  well  to 
optimal  control  of  actuators  made  of  ferroelectric  materials,  e.g.,  piezoelectrics 
and  electrostrictives. 
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Figure  1.2:  Hysteresis  in  the  magnetostrictive  actuator. 

1.1.1  Modeling  and  control  of  hysteresis  based  on  the  Preisach 
operator 

When  the  input  frequency  is  very  low  (typically  below  5  Hz),  the  magnetostrictive 
hysteresis  is  rate-independent  and  can  Ire  modeled  by  a  Preisach  operator  alone. 

We  propose  a  constrained  least  squares  algorithm  to  obtain  a  discrete  approxima¬ 
tion  to  the  Preisach  measure,  and  present  several  algorithms  to  invert  the  Preisach 
operator  efficiently. 

By  inverse  compensation,  one  usually  refers  to  the  trajectory  inversion.  In  many 
applications,  such  as  micro-positioning,  we  are  more  interested  in  the  following 
problem:  given  a  desired  output  value ,  find  an  input  trajectory  such  that  the 
final  value  of  the  output  matches  the  desired  value.  To  distinguish  this  problem 
from  the  trajectory  inversion  problem,  we  call  it  the  value  inversion  problem.  The 
discretized  Preisach  operator  is  a  finite  state  machine  (FSM).  We  formulate  the 
value  inversion  problem  as  a  state  reachability  problem  for  the  FSM.  We  show 
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that  the  FSM  is  reachable  and  propose  a  state  space  reduction  scheme,  which 
significantly  saves  storage  space  and  computation  time. 

When  the  input  frequency  gets  high,  the  magnetostrictive  hysteresis  is  rate- 
dependent  due  to  the  eddy  current  effect  and  the  maguetoelastic  dynamics  of 
the  actuator  rod.  We  propose  a  novel  dynamic  hysteresis  model,  consisting  of  a 
Preisach  operator  coupled  to  an  ordinary  differential  equation  (ODE)  in  an  un¬ 
usual  way  We  establish  the  well-posedness  of  the  model  and  study  its  various 
system-theoretic  properties.  Existence  of  periodic  solutions  under  periodic  forc¬ 
ing  is  proved.  Algorithms  for  simulation  of  the  model  are  also  studied.  Methods 
for  parameter  identification  and  inverse  compensation  for  this  dynamic  model  are 
proposed. 

Inverse  compensation  is  open-loop  in  nature  and  therefore  susceptible  to  model 
uncertainties  and  to  errors  introduced  in  the  inverse  schemes.  We  propose  a  robust 
control  framework  for  smart  actuators  by  combining  inverse  compensation  with 
robust  control  techniques.  We  present  systematic  controller  design  methods  which 
guarantee  robust  stability  and  robust  trajectory  tracking  while  taking  actuator 
saturation  into  account. 

Ideas  and  theories  are  backed  by  extensive  simulation  and  experimental  results. 

1.1.2  Optimal  control  of  hysteresis  based  oil  the  low  di¬ 
mensional  model 

Optimal  control  of  the  magnetostrictive  actuator  is  investigated  based  on  the  low 
dimensional  ferromagnetic  hysteresis  model  proposed  by  Venkataraman  and  Kr- 
ishnaprasad  [84,  82].  We  study  an  infinte  time  horizon  optimal  control  problem  in 
details.  The  value  function  is  characterized  as  the  (unique)  viscosity  solution  to 
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a  Hamilton- Jacobi-Bellman  equation  (HJB)  of  a  hybrid  form.  We  also  provide  a 
numerical  scheme  to  approximate  the  solution. 

The  viscosity  solutions  approach  is  also  extended  to  other  control  problems  of 
practical  interest,  e.g.,  the  finite  time  horizon  problem,  the  time-optimal  control 
problem,  the  exit  problem,  and  the  nonlinear  TL0 c.  control  problem. 

1.2  Organization  of  the  Dissertation 

In  Chapter  2  we  provide  an  introduction  to  the  Preisach  operator,  and  present  iden¬ 
tification  and  inversion  schemes  for  the  Preisach  operator.  The  dynamic  hysteresis 
model  is  proposed  and  studied  in  Chapter  3.  In  Chapter  4  we  discuss  the  robust 
control  framework  for  smart  actuators.  In  Chapter  5,  we  present  the  viscosity 
solutions  approach  for  optimal  control  of  hysteresis  based  on  the  low  dimensional 
model.  Conclusions  and  future  work  are  provided  in  Chapter  6. 
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Chapter  2 


Identification  and  Approximate 
Inversion  of  the  Preisach 
Operator 


When  the  input  frequency  is  low  (typically  below  5  Hz),  the  magnetostrictive 
hysteresis  is  rate-independent  and  can  be  modeled  by  a  Preisach  operator  alone.  In 
this  chapter  we  first  give  an  introduction  to  the  Preisach  operator.  Then  we  discuss 
how  to  identify  the  Preisach  measure.  Finally  we  study  two  types  of  inversion 
problems  for  the  Preisach  operator:  the  trajectory  inversion  problem  and  the  value 
inversion  problem. 

2.1  Introduction  to  the  Preisach  Operator 

In  this  section  we  introduce  the  Preisach  operator  and  some  of  its  properties. 
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2.1.1  The  Preisach  operator  in  (/3.a)  coordinates 

For  a  pair  of  thresholds  (0,  n)  with  0  <  a,  consider  a  simple  hysteretic  element 

•],  as  illustrated  in  Figure  2.1.  For  u  £  C^fO,!1])  and  an  initial  configuration 

(  £  {  —  1,1},  the  function 

v  =  7A«[«,C]:[0,n->{-l»l} 

is  defined  as  follows  [86]: 

—  1  if  u(0)  <  0 
(  if  0  <  u( 0)  <  a  > 

1  if  u(0)  >  a- 

and  for  t,  £  (0,7’],  setting  Ah  =  {r  £  (0 ,t]  :  u(r)  =  0  or  ct}, 

v(0)  if  Ah  0 

v(t)  =  <  —1  if  Ah  /  0  and  u(rnax  Ah)  =  0  ■ 

1  if  Ah  0  0  and  idrnax  Xt)  =  a 

This  operator  is  sometimes  referred  to  as  an  elementary  Preisach  hysteron  (we 
will  call  it  a  hysteron  in  this  dissertation),  since  it  is  a  building  block  for  the 
Preisach  operator. 

u 


Figure  2.1:  The  elementary  Preisach  hysteron. 


The  Preisach  operator  is  a  weighted  superposition  of  all  possible  hysterons. 
Define  Pq  =  {((3, a)  £  R2  :  (3  <  a}.  P0  is  called  the  Preisach  plane,  and  each 
(In)  G  Pq  is  identified  with  the  hysteron  7,3, a •  For  u  G  C([0,T])  and  a  Borel 
measurable  initial  configuration  Co  of  all  hysterons: 

Co:  A -{-U}. 

the  output  of  the  Preisach  operator  T  is  defined  as  [86]: 

y(t)  =  r[u,Co](i)  =  [  %,a[u,(o(P,®)}(t)dv(p,a),  (2.1) 

JPo 

where  u  is  a  finite,  signed  Borel  measure  on  Pq,  called  the  Preisach  measure. 
Appendix  B  provides  an  introduction  to  the  measure  theory. 

In  this  dissertation,  we  call  the  Preisach  measure  v  nonsingular  if  \u\  is  ab¬ 
solutely  continuous  with  respect  to  the  two-dimensional  Lebesgue  measure,  and 
singular  otherwise.  By  the  Radon-Nikodym  theorem,  if  u  is  nonsingular,  there 
exists  a  Borel  measurable  function  //,  such  that 

r  [it,  Co]  (t)  =  jj  //(  bo)7ln[//.Cn(lo)](/!t/bt/o.  (2.2) 

The  weighting  function  //  is  often  referred  to  as  the  Preisach  function  [58]  or  the 
density  function  [20] . 

To  simplify  the  discussion,  throughout  the  dissertation  we  assume  that  p  has 
a  compact  support,  i.e.,  p((3,a)  —  0  if  (3  <  /3o  or  a  >  ao  for  some  f3o,ao.  In  this 
case  it  suffices  to  consider  the  finite  triangular  area 

p  =  {(.j.  a)  G  R’"|a  >  (3,  (3  >  (30,  a  <  a0},  (2.3) 

as  shown  in  Figure  2.2(a).  Without  loss  of  generality,  we  further  assume  that 
a 0  =  Po  =■  r0  >  0. 
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The  memory  effect  of  the  Preisach  operator  can  be  captured  by  curves  in  P. 
At  each  time  instant  t,  define 

P-(t)  =  {(/ 3, a )  E  P |  output  of  at  t  is  —  1}, 

P+(t)  =  {((3,a)  £  P |  output  of  7/3, a  at  t  is  +  1}, 

so  that  P  =  P~(t)  U  P+(t),  V t.  Eq.  (2.2)  can  be  rewritten  as: 

//(/)  —  /  /  n(l3, a)d0da  —  I  I  //(  i. o )d(3da.  (2-4) 

J  Jp+(t)  ’  J  Jp-(t) 

Now  assume  that  at  some  initial  time  to,  the  input  u{to)  =  Uq  <  /% ■  Then 
the  output  of  every  hysteron  is  —1.  Therefore  P-(to)  =  P ,  P+(to)  =  0  and  it 
corresponds  to  the  “negative  saturation”  (Figure  2.2(b)).  Next  we  assume  that  the 
input  is  monotonieally  increased  to  some  maximum  value  at  t\  with  u(ti)  =  u\  . 
The  output  of  7/3, Q:  is  switched  to  +1  as  the  input  u(t)  increases  past  a.  Thus  at 
time  ti,  the  boundary  between  P-{t\)  and  P+(t i)  is  the  horizontal  line  a  =  U\ 
(Figure  2.2(c)).  Next  we  assume  that  the  input  starts  to  decrease  monotonieally 
until  it  stops  at  to  with  u(to)  =  112-  It’s  easy  to  see  that  the  output  of  7/3, a  becomes 
—  1  as  u(t)  sweeps  past  /i,  and  correspondingly,  a  vertical  line  segment  0  =  u2  is 
generated  as  part  of  the  boundary  (Figure  2.2(d)).  Further  input  reversals  generate 
additional  horizontal  or  vertical  boundary  segments. 

From  the  above  illustration,  we  can  see  that  each  of  P_  and  P+  is  a  connected 
set,  and  the  output  of  the  Preisach  operator  is  determined  by  the  boundary  between 
P-  and  P+.  The  boundary  is  called  the  memory  curve.  The  memory  curve  has  a 
staircase  structure  and  its  intersection  with  the  line  a  =  6  gives  the  current  input 
value.  The  memory  curve  ipo  at  t,  —  0  is  called  the  initial  memory  curve  and  it 
represents  the  initial  condition  of  the  Preisach  operator. 

If  the  Preisach  measure  is  nonsingular,  we  can  identify  a  configuration  of  hys- 
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Figure  2.2:  Memory  curves  in  the  Preisach  plane. 

terons  (,/,  with  a  memory  curve  ip  in  the  following  way:  Q,(P,a)  =  1  (—  l,resp.)  if 
(P,  a)  is  below  (above,  resp.)  the  graph  of  ip.  Note  that  it  does  not  matter  whether 
(,/,  takes  1  or  —  1  on  the  graph  of  ip. 

In  the  sequel  we  will  put  the  initial  memory  curve  tpo  as  the  second  argument 
of  P,  where 

r(vi/>o]  =  r[-, 


2.1.2  The  Preisach  operator  in  (r,  s)  coordinates 

Sometimes  it  is  more  convenient  to  describe  the  Preisach  operator  using  the  (r,  s) 
coordinates  with  r  =  and  s  =  If  the  Preisach  measure  is  nonsingular, 

the  output  of  the  Preisach  operator  can  be  expressed  in  terms  of  (r,  s )  as: 

COC  poo 

y(t)  =  r[u,  ipo](t)  =  /  u(r,  s)7,_r,a+r[«,  Cto(s  —  r,s  +  r)](t)dsdr ,  (2.5) 

J  0  J—  oc 


n 


where  o>(-,  •)  is  the  density  function  expressed  in  the  (r,  s)  coordinates.  In  the  new 
coordinates,  a  memory  curve  if[t]  at  time  t  is  the  graph  of  a  function  of  r,  and 
if[t]( 0)  gives  the  current  input  value  u(t)  (Figure  2.3).  Eq.  (2.5)  can  be  rewritten 

£lSI 

/»oc  AOC 

y(t)  =  T[u,ifo](t)  =  u0  —  2  /  /  u(r,s)dsdr,  (2.6) 

Jo  Ji/>[t](r) 

where  vq  is  the  output  corresponding  to  the  positive  saturation. 


.S' 


Figure  2.3:  The  Preisach  plane  in  (r,  s)  coordinates. 

Although  practically  a  memory  curve  is  only  composed  of  segments  of  slope  ±1 
in  (r,  s)  coordinates,  we  make  the  following  definition: 

Definition  2.1.1  [20,  37]  The  set  of  memory  curves  T  is  defined  to  be  the  set  of 
continuous  functions  if  :  [0,ro]  — >  R  such  that 

1.  h/’(r;i)  -  ip(r2)\  <  \n  -  r2|,  Vri,r2  €  [0,ro], 

2.  r(/'0)  =  0, 

where.  r0  is  the  constant  defined  in  Subsection  2.1.1. 

The  graph  of  any  if  £  T  is  confined  in  the  triangular  region  Pr,  as  shown  in 
Figure  2.4. 
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Figure  2.4:  The  set  T  of  memory  curves. 


Remark  2.1.2  Including  in  T  all  functions  with  Lipschitz  constant  1  leads  to  a 
complete  metric  space  [37],  which  will  facilitate  analysis  in  the  sequel.  In  addition 
this  will  allow  one  to  include  certain  initial  hysteron  configurations  carrying  phys¬ 
ical  interpretations,  e.g..  'ijj(r)  =  0,  Vr  €  [0,  r0],  can  represent  the  demagnetized 
virgin  state  in  ferromagnetics  [58,  86]. 

We  will  switch  between  the  (  To)  coordinates  and  the  (r, s)  coordinates  in  this 
dissertation. 


2.1.3  Properties  of  the  Preisach  operator 

The  Preisach  operator  has  a  number  of  important  properties  [58,  86,  20].  The 
following  theorems  summarize  some  properties  which  will  be  useful  for  development 
of  results  in  this  dissertation. 

Theorem  2.1.3  [86]  Let  v  be  a  Preisach  measure.  Let  u.  u\.  //•_>  €  <7([0,T])  and 
b’o  €  T.  Then  the  following  hold: 

1.  (Rate-independence)  If  d>  :  [0,T]  — ►  [0,  T]  is  an  increasing  continuous 
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function  satisfying  6(0)  =  0  and  6(T)  =  T ,  then 


v  n  <><>.  Ct)  (t i  =  rfo.  i.\,\(< >{t)).  vt  e  [o ,t], 

where  “o”  denotes  composition  of  functions. 

2.  (Strong  continuity)  Ifu  is  nonsingular,  then  T[-,  ip0]  ■  C([0,T])  — >  (7([0,  T]) 
is  strongly  continuous  fin  the  sup  norm). 

3.  (Piecewise  monotonicity)  Assume  u  >  0.  If  u  is  either  nondecreasing  or 
nonincreasing  on  some  interval  in  [0,  T],  then  so  is  r[u,^0]- 

4 ■  (Order  preservation)  Assume  u  >  0.  If  Ui  <  u-2  on  [0,!1],  then 

I>i,P0]  <  r[«2,^o] 

on  [0,  T\. 

Theorem  2.1.4  (Lipschitz  continuity)  [20]  1  Assume  that  the  Preisach  mea¬ 
sure  v  is  nonsingular.  Let  u  be  the  Preisach  density  function  in  ( r,s )  coordinates. 
Then  for  any  y>0  E  T,  r[v0o]  is  Lipschitz  continuous  on  C([ 0,  T])  with  Lipschitz 
constant  2Ci  if 

/*OC 

Ci  =  /  sup  \ui(r,  s)\dr  <  oo.  (2.7) 

Jo  seR 


2.2  Identification  of  the  Preisach  Measure 

2.2.1  Review  of  measure  identification  methods 

For  the  Preisach  operator,  the  only  parameter  is  the  Preisach  measure.  A  classical 
method  for  identifying  the  Preisach  density  function  is  using  the  so  called  first 
1See  also  [86]  for  a  slightly  different  version. 
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order  reversal  curves ,  detailed  in  Mayergoyz  [58].  A  first  order  reversal  curve  can 
be  generated  by  first  bringing  the  input  to  (30,  followed  by  a  monotonic  increase 
to  a,  then  a  monotonic  decrease  to  (3.  The  term  “first  order  reversal”  comes  from 


that  each  of  these  curves  is  formed  after  the  first  reversal  of  the  input.  Denote  the 


output  value  as  f((3,  a)  when  the  input  reaches  j'3.  Then  the  density  fi{(3,  a)  can 


be  obtained  as 

1  d2f(p,  a) 

2  dftda 


(2.8) 


Since  it  involves  twice  differentiation,  a  smooth  approximating  surface  is  fit  to 
the  data  points  in  practice  [45,  46,  38].  Hughes  and  Wen  [45,  46]  approximated 


the  surface  by  polynomials  using  a  least  squares  method.  Gorbet,  Wang  and 


Morris  employed  functions  with  specific  forms,  and  the  parameters  were  obtained 
via  a  weighted  least  squares  algorithm  [38].  A  fuzzy  approximator  was  adopted 
to  approximate  the  surface  in  [62].  As  pointed  out  in  [38],  deriving  the  density 
by  differentiating  a  fitted  surface  is  inherently  imprecise,  since  different  types  of 
approximating  functions  lead  to  quite  different  density  distributions. 

Hoffmann  and  Sprekels  [53]  proposed  a  scheme  to  identify  the  Preisach  measure 
directly.  By  devising  the  input  sequence  carefully,  they  set  up  independent  blocks 


of  linear  equations  involving  the  output  measurements  and  the  weighting  masses 


in  the  discretized  Preisach  plane,  with  the  number  of  measurements  equal  to  that 


of  unknowns.  Each  block  of  equations  can  be  solved  successively  to  obtain  the 
weighting  masses.  This  scheme  is  very  sensitive  to  experimental  errors  as  one  can 
easily  see.  Using  the  identified  weighting  masses  [53],  Hoffman  and  Meyer  [52] 
approximated  the  density  function  in  terms  of  a  set  of  basis  functions.  A  least 
squares  method  was  applied  to  compute  the  coefficients. 

Another  method  for  measure  identification  is  driving  the  system  with  a  “rea- 
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sonably”  rich  input  signal,  measuring  the  output  and  then  estimating  the  density 
by  a  least  squares  method.  This  idea  appeared  in  the  work  of  Banks  and  his 
colleagues  [7,  8],  where  they  investigated  the  identification  problem  of  the  KP  op¬ 
erator.  Galinaitis  and  Rogers  [35]  used  the  same  idea  to  identify  the  weights  for  a 
discretized  KP  operator.  We  also  adopt  the  least  squares  method  for  identification 
of  the  Preisach  measure  [79]. 


2.2.2  An  identification  scheme 

Smart  actuators,  due  to  the  capacity  of  the  windings  or  other  practical  reasons, 
have  to  be  operated  with  their  inputs  within  specific  ranges.  As  a  consequence,  we 
will  not  be  able  to  visit  the  whole  Preisach  plane  and  identify  the  density  function 
everywhere  during  the  identification  process.  We  assume  that  the  input  range  is 
[■ Umin,umax ]•  hi  Figure  2.5,  the  bigger  triangle  represents  the  set  P  (recall  the 
definition  (2.3)),  while  the  smaller  triangle  is  the  region  f2:l  that  we  can  visit.  The 
region  outside  fR  but  inside  the  set  P  is  denoted  by  F20-  Since  the  input  u(t)  never 
goes  beyond  the  limits,  the  states  of  the  hysterons  in  f20  remain  unchanged.  Thus 
the  bulk  contribution  to  the  output  from  Qq  is  a  constant  and  we  denote  it  by  u0. 

The  input  is  discretized  into  L  +  1  levels  uniformly  (we  will  call  this  discretiza¬ 
tion  of  level  L)  and  we  label  the  cells  in  the  grid  as  illustrated  in  Figure  2.5 
for  L  =  9.  The  Preisach  measure  within  each  cell  is  assumed  to  concentrate  as 
a  discrete  mass  at  the  cell  center.  The  quantities  we  want  to  identify  include 
weighting  masses  Uij,i  =  1,  •  •  •  ,L,j  —  1,  •  •  •  ,  i  and  uq.  To  simplify  the  discussion, 
with  a  slight  abuse  of  notation,  we  write  {r'p}  as  a  column  vector  {z/feffL, ,  where 
I\  =  — ■ L,+i'> .  We  note  that  discretization  of  the  Preisach  plane  leads  to  a  discretized 
Preisach  operator,  which  is  a  weighted  sum  of  K  hysterons  (see  Figure  2.6). 
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Figure  2.5:  Discretization  of  the  Preisach  plane  (L  =  9)  [79]. 

To  initialize  the  states  of  hysterons,  we  decrease  the  input  to  umin-  This  sets 
the  state  of  each  hysteron  in  D|  to  —  1.  We  then  apply  some  piecewise  monotone, 
continuous  input  u(t).  and  measure  the  output  y(t).  The  input  u(t)  should  be 
chosen  in  such  a  way  that  the  contribution  of  each  weighting  mass  can  be  singled 
out,  and  one  candidate  for  such  u(t)  is  the  concatenation  of  the  first,  order  reversal 
inputs.  Signals  u(t),y(t )  are  then  sampled  into  sequences  {u[n]}^=1,  {y[n\}^=1. 
The  input  sequence  {«[n]}  (after  discretization)  is  fed  into  the  discretized  Preisach 
operator  and  the  state  of  each  hysteron,  {^[n]},  A:  =  1,  •  •  •  ,  K,  is  computed.  The 
output  of  the  Preisach  model  at  time  instant  n  can  be  expressed  as: 

K 

y[n]  =  is0  +  Y^  WWW,  (2-9) 

k=  1 

where  { ///,, )  0  Yet  to  be  found. 

We  use  the  least  squares  method  to  estimate  the  parameters,  i.e.,  the  parame- 
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Figure  2.6:  The  discretized  Preisach  operator, 
ters  are  determined  in  such  a  way  that 

N 

^2\y[n)-y[n}\2  (2.10) 

n=l 

is  minimized.  Since  we  require  ig,  >  0, k  =  1, •  •  •  ,I\,  it  is  a  constrained  least 
squares  problem. 

Remark  2.2.1  Theoretically  the  weighting  masses  can  be  computed  directly  from 
the  first,  order  reversal  curves.  This  works  if  the  signals  are  noise-free,  which  is 
usually  not  the  case.  Therefore  we  use  the  least  squares  method. 

2.2.3  Experimental  results 

In  general  the  magnetostriction  depends  on  both  the  mechanical  pre-stress  a  and 
the  magnetic  held  H  [30].  Pre-stress  is  applied  to  the  magnetostrictive  actuator 
through  preloaded  springs  (see  Figure  1.1)  and  that  improves  magnetostriction. 
The  pre-stress  is  not  adjustable  once  the  actuator  is  manufactured,  and  it  does 
not  change  much  during  operation  considering  the  magnitude  of  magnetostriction 
(less  than  1500  parts  per  million  for  Terfenol-D).  Therefore  we  assume  that  the 
magnetostriction  is  only  dependent  on  the  magnetic  held  H. 
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For  the  purpose  of  control,  we  define  the  magnetostriction  A  to  be 


A 


A l_ 

lrod 


(2.11) 


where  lrod  is  the  length  of  the  magnetostrictive  rod  in  the  demagnetized  state,  and 
A l  is  the  change  of  the  rod  length  from  lrod.  The  saturation  magnetostriction  \s  is 
defined  in  an  obvious  way.  Note  our  definition  of  A.,  is  slightly  different  from  that 
in  [23]. 

When  the  input  frequency  is  low,  the  magnetostrictive  hysteresis  is  rate  inde¬ 
pendent:  roughly  speaking,  the  shape  of  the  hysteresis  loop  does  not  depend  on 
the  input  frequency.  In  this  case,  we  can  relate  A  to  the  bulk  magnetization  M 
along  the  rod  direction  by  a  square  law  [82] 


\  =  a,M2. 


(2.12) 


and  relate  the  input  current  I  to  the  magnetic  field  H  (assumed  uniform)  along 
the  rod  direction  by 

H  =  c0I  +  Hbias ,  (2-13) 

where  c0  is  the  so  called  coil  factor ,  and  Hbias  is  the  bias  field  produced  by  per¬ 
manent  magnets  or  a  dc  current.  Hbias  is  necessary  for  generating  bidirectional 
strains.  Hence  we  can  capture  the  hysteretic  relationship  between  A  and  I  by 
the  ferromagnetic  M  —  H  hysteresis.  Venkataraman  employed  a  low  dimensional 
ferromagnetic  hysteresis  model  in  [82].  We  will  use  a  Preisach  operator  to  model 
M  —  H  hysteresis. 


Remark  2.2.2  Due  to  the  thin  rod  geometry,  we  approximate  the  continuum  mag¬ 
netization  in  the  magnetostrictive  rod  by  the  bulk  magnetization.  The  square  law 
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(2.12)  follows  from  the  continuum  theory  of  micromagnetics ,  where  the  magne¬ 
toelastic  energy  is  of  the  form  linear  in  the  strain  and,  quadratic  in  the  direction 
cosines  of  the  magnetization  vector  [22). 

Remark  2.2.3  Mayergoyz  has  shown  that,  the  necessary  and  sufficient  conditions 
for  a  hysteretic  nonlinearity  to  be  represented  by  the  Preisach  model  are  the  wiping- 
out,  property  and  the  congruency  property  [58].  While  the  wiping-out  property  for 
the  ferromagnetic  hysteresis  can  be  directly  verified,  we  will  indirectly  verify  the 
congruency  property  by  a  trajectory  tracking  experiment  based  on  inversion  of  the 
Preisach  operator. 

The  following  parameters  are  available  from  the  manufacturer:  the  saturation 
magnetization  Ms  =  7.87  x  10° A/m,  lrod  =  5.13  x  10_2m,  cq  =  1.54  x  104/m. 
We  can  easily  identify  Xs  =  1.313  x  10“  3  by  applying  an  input  of  relatively  large 
magnitude,  and  then  get  the  coefficient  «L  =  -fh.  The  bias  field  Hbias  is  identified 
to  be  1.23  x  104A/m. 

Given  a  measurement  of  A,  we  compute  M  =  ±y^  and  the  sign  of  M  is 
determined  with  further  information  on  the  input.  The  Preisach  weighting  masses 
can  be  identified  with  the  constrained  least  squares  algorithm  as  described  in  the 
previous  subsection. 

Our  experimental  setup  is  as  shown  in  Figure  2.7  .  DSpace  ControlDesk  is 
a  tool  for  real-time  simulation  and  control.  The  displacement  of  the  actuator  is 
measured  with  a  LVDT  sensor,  which  has  a  precision  of  about  1  /mi. 

The  magnetic  field  input  H  is  limited  to  [1.57  x  103A/m,  3.25  x  104A/m]  and  we 
discretize  the  Preisach  plane  into  25  levels.  Figure  2.8  shows  the  distribution  of  the 
identified  weighting  masses.  The  constant  contribution  i/Q  from  f20  (see  Figure  2.5) 
is  estimated  to  be  4.99  x  105A/m. 
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Figure  2.7:  Experimental  setup. 


Figure  2.8:  Distribution  of  the  Preisach  weighting  masses 


Remark  2.2.4  Due  to  the  bias  field  Hbias  and  the  constraint  on  the  input  current, 
we  can  not  trace  the  major  loop  of  the  M  -  H  hysteresis;  instead  we  can  only  visit 
a  certain  region  inside  the  major  loop.  As  a  result,  the  magneto strictive  hysteresis 
loop  (the  butterfly  curve)  is  asymmetric  (Figure  1.2). 

2.3  Inversion  of  the  Preisach  Operator 

The  general  structure  of  models  for  smart  actuators  that  capture  both  hysteresis 
and  dynamic  behaviour  is  shown  in  Figure  2.9  [85]  .  In  the  figure,  G(s)  represents 
the  transfer  function  of  the  linear  part  in  the  actuator,  while  W  denotes  a  rate- 
independent  hysteretic  nonlinearity.  Venkataraman  [82]  has  shown  that  a  key 
component  of  a  low  dimensional  model  for  magnetostriction  in  Terfenol-D  has  a 
structure  resembling  Figure  2.9  . 

_ i± 

Rate-independent  Linear  system 

hysteresis  operator 

Figure  2.9:  Structure  of  models  for  smart  actuators  [85]. 

A  basic  idea  for  controller  synthesis  for  such  systems  is  to  design  a  right  inverse 
operator  TV-1  for  W  as  shown  in  Figure  2.10.  Then  v(-)  =  v(j  and  the  controller 
design  problem  is  reduced  to  designing  a  linear  controller  K(s)  for  the  linear  system 
G(s). 

In  the  context  of  this  dissertation,  we  consider  W  to  be  a  Preisach  opera¬ 
tor.  The  Preisach  operator  is  highly  nonlinear,  and  in  general,  we  cannot  find 
a  closed-form  formula  for  the  inverse  operator,  unless  the  density  function  is  of 
some  special  form,  as  in  the  work  of  Galinaitis  and  Rogers  [34],  Hughes  and  Wen 
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yref 


Controller 


Figure  2.10:  Controller  design  schematic  [85]. 

[45,  46]  utilized  the  first  order  reversal  curves  in  computing  the  numerical  inverse 
of  the  Preisach  operator.  This  method  relies  on  measurement  of  all  first  order 
reversal  curves  and  involves  solving  nonlinear  equations.  Natale  and  his  colleagues 
proposed  using  another  Preisach  operator  as  a  “pseudo-compensator”  to  approx¬ 
imate  the  inverse  of  a  Preisach  operator  [62],  where  the  Preisach  density  of  the 
compensator  is  identified  with  the  same  set  of  experimental  data  used  in  identi¬ 
fication  of  the  original  Preisach  operator,  but  with  the  roles  of  input  and  output 
swapped.  The  compensator  is  “pseudo"  because  it  is  well  known  that  in  general, 
the  inverse  of  a  Preisach  operator  is  not  a  Preisach  operator.  Venkataraman  and 
Krishnaprasad  [85]  utilized  piecewise  monotonicity  and  Lipschitz  continuity  of  the 
Preisach  operator,  and  proposed  an  inversion  algorithm  based  on  the  contraction 
mapping  principle. 

The  Preisach  operator  is  rate-independent,  and  at  any  time  f,  the  memory  curve 
(and  thus  the  output)  depends  only  on  the  dominant  maximum  and  minimum 
values  in  the  past  input.  Therefore  we  are  mainly  interested  in  the  inversion 
problem  in  the  discrete-time  setting. 
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2.3.1  Inversion  of  the  discretized  Preisach  operator 


First  we  study  inversion  of  a  discretized  Preisach  operator  obtained  as  a  result  of 
input  discretization. 

Let  U  be  the  discrete  control  set,  i.e.,  U  =  { xy ,  1  <  l  <  L  +  1}  with 


Ui  =  Iimin  +  (J  ~  O^u,  where  Su 


^ max  'll min 

L 


Let,  Sn  be  the  set  of  input  strings  of  length  n  taking  values  in  U,  i.e.,  if  s  G  Sn, 
then  s[i]  E  U.  l  <i<  n.  Let  xV(i  be  the  set  of  memory  curves  for  the  discretized 


Preisach  operator. 

Trajectory  Inversion  Problem  of  Length  N:  Given  an  initial  memory 
curve  b’o  €  xl’d  and  a  desired  output,  sequence  y  of  length  N,  find  s*  G  SN ,  such 


that 


.max  |r[.sLro  >'|  “  j/MI  =  min  max  -  y[i] \.  (2.14) 

1<%<N  seS n  1<^<A 

We  call  this  the  trajectory  inversion  problem,  to  distinguish  it  from  the  value 
inversion  problem  we  will  discuss  in  the  next  section. 


Remark  2.3.1  We  put  a  sequence  instead  of  a  continuous  time  function  as  the 
first  argument  ofT  in  (2.1  f).  To  avoid  ambiguity ,  it  is  tacitly  understood  that  the 
input  is  changed  monotonically  from  s[i]  to  s[*  +  1].  Throughout  the  dissertation 
we  may  use  a  sequence  or  a  continuous  time  function  as  the  first  argument  of  T 
depending  on  the  context. 

Remark  2.3.2  A  discretized  Preisach  operator  is  not  “onto"  since  its  output  takes 
values  in  a  finite  set.  Therefore  we  don’t  seek  an  exact  inverse  in  the  problem 
formulation. 
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Before  we  present  the  solution  to  the  problem  above,  we  first  look  at  the  case 
when  N  =  1: 

Trajectory  Inversion  Problem  of  Length  1:  Given  an  initial  memory 

curve  Go  G  'Ifii  and  a  desired  output  yQ,  find  u*  G  U ,  such  that 

\T{u*,Go]  ~  Vo\  =  min  |r[u,  r„)  -  y0\.  (2.15) 

ueu 

There  is  a  simple  algorithm  for  solving  the  problem  of  length  1 ,  which  is  based 
on  the  piecewise  monotonicity  of  the  Preisach  operator  [79].  We  name  it  the  closest 
match  algorithm  because  it  always  generates  an  input  whose  output  matches  the 
desired  output  most  closely  among  all  possible  inputs. 

The  idea  of  the  closest  match  algorithm  is  as  follows.  One  can  obtain  the  initial 
input  w-°>  and  output  i/0'1  from  the  initial  memory  curve  Go-  Consider  the  case 
y(°)  <  fjo  (the  case  y(0)  >  y0  is  treated  in  exactly  the  same  way  with  some  obvious 
modification).  We  keep  increasing  the  input  by  one  level  in  each  iteration  until, 
say  at  iteration  n,  the  input  irn’  reaches  Umax-  or  the  output  i/n>  corresponding  to 
exceeds  y0.  For  the  first  case,  the  optimal  input  is  clearly  Umax ;  for  the  second 
case,  two  candidates  for  the  optimal  input  u*  are  ?dn_b  and  ifn\  We  then  take 
u*  to  be  the  one  with  the  smaller  output  error.  Note  that  we  need  back  up  the 
memory  curve  whenever  we  increase  the  input,  so  that  we  can  always  retrieve  the 
consistent  memory  curve  with  a*. 

The  above  algorithm  yields  the  optimal  input  u*  in  at  most  L  iterations.  And 
in  each  iteration,  the  evaluation  of  y  G  is  very  fast  since  the  input  has  changed  by 
one  level  and  thus  we  need  only  update  states  of  hysterons  corresponding  to  that 
level.  These  factors  combine  to  make  this  algorithm  simple  and  efficient. 

The  trajectory  inversion  problem  of  length  A’  is  solved  by  combining  the  closest 
match  algorithm  and  the  dynamic  programming  principle  [13]. 
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Let  Ed  :  xVd  x  U  — ►  be  the  evolution  map  for  the  memory  curve,  i.e.,  if 

ib  G  vI/d  is  the  initial  memory  curve,  then  E d(u,ip)  is  the  new  memory  curve  when 
the  input  u  G  U  is  applied. 

Given  N  and  the  sequence  y,  for  I  <  k  <  N,  we  define 

JM,  s)  =  max  |r[.s,L’][i]  —  y[i\ |,  s  G  SN~k+1,  (2.16) 

k<i<N 

W)=  min  .4(®.s).  (2.17) 

where  we  call  the  cost  function  and  14  the  value  function. 

Proposition  2.3.3  The  value  functions  Vk ,  1  <  k  <  Ah  can  be  solved  successively 
via: 


=  min  |r[a,t/;]  —  y[Ar]|,  (2-18) 

u€l ' 

=  min max{|r[u,  p]  -  y[A:]|,  Pfc+i (Sd(«,  i/j))}.  (2.19) 

uEu 

Define  maps  1 r£  :  — »■  C/,  1  <  h  <  N .  so  that  tt (;(-0)  hs  the  argmin  in  (2.18)  and 

(2.19).  Then  for  the  trajectory  inversion  problem  of  length  N,  1 T*k)  1  <  k  <  N, 
gives  the  optimal  control  policy  at  time  k. 

Proof  Straightforward  from  Bellman’s  optimality  principle.  ■ 

The  closest  match  algorithm  can  be  used  in  solving  (2.18)  and  (2.19).  Propo¬ 
sition  2.3.3  entails  pre-computing  and  storage  of  the  optimal  maps,  which  is  un¬ 
desirable  when  N  or  the  cardinality  of  is  large.  A  sub-optimal  approach  is  to 
decompose  the  inversion  problem  of  length  N  into  N  successive  inversion  problems 
of  length  1  and  solve  them  using  the  closest  match  algorithm.  The  experimental 
result  of  trajectory  tracking  based  on  this  approach  can  be  found  in  [79]. 
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2.3.2  Inversion  of  the  Preisach  operator  with  nonsingular 


measure 


We  now  discuss  the  inversion  problem  for  a  Preisach  operator  with  nonsingular 
Preisach  measure.  In  this  case,  the  Preisach  operator  can  be  inverted  with  arbitrary 
accuracy,  and  it  suffices  to  study  an  inversion  problem  of  length  1 :  given  Wo  £  h 
and  A 1  £  [A/mm,  PLrao.f .  find  H  £  \Hmin,  Hmaxf  such  that 

M  =  T[H,tl>0], 

where  [Hmin,  Hmax\  and  [Mmin,  Mmax }  are  the  ranges  of  the  input  and  the  output  of 
the  Preisach  operator,  respectively.  The  notation  used  in  this  subsection  is  slightly 
different  from  that  in  Subsection  2.3.1,  but  it  will  be  consistent  with  the  notation 
in  Chapter  3. 


Proposition  2.3.4  Let  the  Preisach  measure  be  nonnegative  and  nonsingular  with 
a  density  function  //..  Let 

POL  0 

//(./.  n  )<Li.  sup  /  n(@,  a)da}  <  oc.  (2.20) 

P  J  P 


Let  the  current  memory  curve  be  ho,  and  let  the  input  and  the  output  of  the  Preisach 


operator  corresponding  to  ip0  be  H0  and  Mo,  respectively.  Given  M  £  \Mrnin,  Mrnax\, 
consider  the  following  algorithm: 


{//!«-!)  _  jj{n)  _|_  M-M{r 
A/(n+i)  = 


(2.21) 


where  =  ip o,  Hi(f>  =  Ho,  A/(0^  =  M0,  and  ffn'>  is  the  memory  curve  after 


{/f(fc)}«=i  is  applied.  Then  A/C-)  — >  M  as  n  — >  oo. 


Proof  The  proposition  follows  directly  from  the  piecewise  monotonicity  property 
and  the  continuity  property  of  the  Preisach  operator.  ■ 
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Remark  2.3.5  The  algorithm  (2.21)  also  appeared  in  [85],  where  approximate 
inversion  of  the  Preisach  operator  was  studied  for  the  class  of  continuous,  piecewise 
monotone  functions. 

What  we  have  identified  in  Subsection  2.2.2  is  a  set  of  Preisach  weighting 
masses,  which  forms  a  singular  Preisach  measure.  We  can  obtain  a  lionsingular 
Preisach  measure  up  by  assuming  that  each  identified  mass  is  distributed  uniformly 
over  the  corresponding  cell  in  the  discretization  grid.  Note  that  the  diagonal  cells 
are  triangular,  while  other  cells  are  square  (refer  to  Figure  2.13(a)).  The  density 
function  pp  corresponding  to  up  is  piecewise  uniform,  which  enables  us  to  solve  the 
inversion  problem  exactly,  as  described  next. 

We  consider  the  case  M  >  M0  and  the  other  case  can  be  treated  analogously. 
It’s  obvious  that  H  >  H(}  and  we  will  increase  the  input  in  every  iteration.  At 
iteration  n,  let  d-"  >  0  be  such  that  H{n)  +  d\n)  equals  the  next  input  level,  and 
let  d<2  >  0  be  the  minimum  amount  such  that  applying  +  d%  will  eliminate 
the  next  corner  of  the  memory  curve  (see  Figure  2.11  for  illustration).  Since  pp  is 
piecewise  constant,  for  d  <  min {d(|,,),  dj>,,)},  we  have 

r [R('l)  +  d.  -  r[//(n),T’(n)]  =  4n)d2  +  a[n)d, 

where  a\n\alfl'>  >  0  can  be  computed  from  pp,  and  the  square  term  is  due  to  the 
contribution  from  the  triangular  region  inside  the  diagonal  cell.  Let  d$  be  such 
that 

M  -  \"Hi").ci")}  =  4n)(df,n))2  +  a'C'd'f". 

The  inversion  algorithm  now  works  as  follows: 

|  dfn =  inin{ c/q"  ' ,  d\n) ,  d^} 

H(n+1)  =  H(n)  +  d(n) 

A/(n+ 1)  =  r[R(n+1),T>(n)] 
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(2.22) 


Figure  2.11:  Illustration  of  d\n)  and  d^l\ 

If  at  iteration  n*,  d(n'^  =  <4”  \  then  the  iteration  stops  and  H  =  fTn*+0_  Let 
nc(ip o)  be  the  number  of  corners  of  Cq.  and  L  the  discretization  level  of  the  Preisach 
plane.  It’s  easy  to  see  the  algorithm  (2.22)  yields  the  (exact)  solution  in  no  more 
than  n  =  nc(ipo )  +  L  iterations. 

Figure  2.12  shows  the  result  of  an  open-loop  tracking  experiment  using  the 
algorithm  (2.22).  The  desired  trajectory  was  obtained  from  the  output  of  a  Van 
der  Pol  oscillator  to  make  the  tracking  task  more  challenging.  In  Figure  2.12,  the 
displacement  trajectories  (both  the  desired  and  the  measured),  the  tracking  error 
and  the  input  current  are  displayed.  The  overall  performance  is  satisfactory  since 
the  error  magnitude  is  less  than  4  /mi  most  of  the  time  with  a  tracking  range 
of  60  /mi.  We  can  see  that  the  tracking  error  slightly  exceeds  4  /mi  when  the 
desired  output  (and  thus  the  input)  undergoes  abrupt  changes,  in  which  case  the 
rate-independence  assumption  no  longer  holds. 
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2.4  The  Value  Inversion  Problem  and  Its  Appli¬ 
cation  to  Micro-Positioning  Control 


By  inverse  compensation,  one  usually  refers  to  the  trajectory  inversion  problem: 
given  a  desired  output  trajectory,  compute  an  input  trajectory  whose  correspond¬ 
ing  output  trajectory  matches  the  desired  one.  In  many  applications,  such  as 
micro-positioning,  we  are  more  interested  in  the  following  problem:  given  a  de¬ 
sired  output  value ,  find  an  input  trajectory  such  that  the  final  value  of  the  output 
matches  the  desired  value.  To  distinguish  this  problem  from  the  trajectory  inver¬ 
sion  problem,  we  call  it  the  value  inversion  problem.  This  problem  has  been  well 
studied  for  linear  systems,  but  to  our  best  knowledge,  very  little  has  been  done  in 
the  context  of  hysteretic  systems. 

The  Preisach  operator  becomes  a  finite  state  machine  (FSM)  after  discretiza¬ 
tion,  and  the  value  inversion  problem  can  be  transformed  into  a  state  reachability 
problem  for  the  FSM.  We  show  that  the  FSM  is  reachable  and  indicate  how  to 
construct  the  input  sequence  for  the  state  transition.  After  observing  that,  for 
practical  reasons,  there  may  exist  a  large  number  of  equivalent  states  in  the  FSM, 
we  propose  a  state  space  reduction  scheme,  which  can  significantly  save  storage 
space  and  computation  time.  An  algorithm  for  generating  the  optimal  (the  sense  of 
“optimality”  will  be  clear)  representative  state  in  each  equivalent  class  is  presented. 

2.4.1  The  value  inversion  problem 

In  any  practical  identification  scheme  for  the  Preisach  measure,  a  discretization 
step  is  involved  in  one  form  or  another.  Figure  2.13(a)  shows  our  discretization 
scheme  used  earlier  in  measure  identification,  where  the  Preisach  measure  inside 
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each  cell  is  assumed  to  concentrate  at  the  cell  center  (represented  by  dark  dots  in 
Figure  2.13(a)).  As  noted  in  Subsection  2.2.2,  this  results  in  a  discretized  Preisach 
operator.  We  note  that  although  uniform  discretization  is  considered  here,  the 
results  of  this  section  apply  directly  to  the  case  of  non-uniform  discretization. 


Figure  2.13:  (a)  Discretization  of  the  Preisach  plane  (L  =  3);  (b)  Memory  cuve 
“001”  (bolded  lines). 

Let  S  be  the  set  of  input  strings  taking  values  in  U,  where  U  is  as  defined  in 
Subsection  2.3.1.  Let  Sa  be  the  set  of  alternating  input  strings  [20]  in  U,  in  the 
sense  that,  if  sa  €  Sa,  then  (sa[i  +  2]  —  sa[i  +  l])(sa[i  +  1]  —  sa[i])  <0,  Vi  >  0.  The 
value  inversion  problem  is  formulated  as: 

Value  Inversion  Problem:  Given  a  desired  output  value  ijo  and  an  initial 
memory  curve  Do  €  dy,  find  s*  G  Sa ,  such  that 

I'/K-'  oi  -  Vo\  =  min  \Tf{sa,fi0}  -  y0 1,  (2.23) 

sa^SA 

where  T/[s,Do]  denotes  the  final  value  of  the  output  of  the  Preisach  operator  under 
input  sequence  s.  If  there  is  more  than  one  such  string  achieving  (2.23),  find  the 
one  with,  the  minimum  length. 
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Remark  2.4.1  Any  s  £  S  can  be  reduced  to  some  sa  £  Sa  using  the  following 
rules ,  starting  from,  i  =  1:  if  (s[*  +  1]  —  s[*])(s[*  +  2]  —  s[i  +  1])  >  0,  delete 
s [i.  +  1]  and  re-index.  For  example ,  s  =  (tti,  M3,  U3,  U5,  U4,  U2)  £  S  can  be  reduced  to 
.sa  —  (ui, 0,5,112)  £  Sa-  The  final  values  of  the  output  under  s  and  sa  are  identical 
( easy  to  verify),  hence  we  only  need  search  the  optimal  input  sequence  in  Sa- 

Remark  2.4.2  The  length  of  an  alternating  input,  string  is  directly  linked,  to  the 
number  of  input  reversals  and  thus  the  complexity  of  implementing  that  input. 
Therefore  we  seek  s*  with  the  minimum  length. 

The  discretized  Preisach  operator  can  be  treated  as  a  finite  state  machine 
(FSM).  Since  there  are  I(I,+  li  hysterons  for  a  discretized  Preisach  model  with 
discretization  level  L  and  the  output  of  each  hysteron  takes  values  in  {—1,1},  the 
number  of  states  appears  to  be  2i(i+|h2.  This  is  not  the  case  in  general,  recalling 
that  the  true  state  is  the  memory  curve. 

Proposition  2.4.3  For  a  discretized  Preisach  operator  with  discretization  level  L, 
the  number  of  states  is  2L. 

Proof  In  the  (/3,a)  coordinates,  each  memory  curve  consists  of  L  horizontal  or 
vertical  segments  of  length  Su,  so  the  total  number  of  memory  curves  is  2L.  ■ 

The  proof  motivates  an  indexing  scheme  for  the  memory  curve.  Starting  from 
the  upper  left  corner,  we  number  each  memory  curve  with  L  bits  corresponding  to 
the  L  segments:  0  represents  vertical,  and  1  represents  horizontal.  For  instance, 
the  memory  curve  represented  by  the  bolded  lines  in  Figure  2.13(b)  reads  “001”. 
We  can  now  give  a  complete  description  for  the  FSM.  It  has  state  space 

A>d  =  {if  :  if  =  (aL,  ql_1,  •  •  •  ,  q1),  od  £  {0, 1},  j  =  1,  •  •  •  ,  L} 
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and  input  space  U.  It  is  a  state  output  automaton  [15]  since  the  output  y  of 
the  Preisach  operator  depends  only  on  the  memory  curve  ip.  Therefore  the  value 
inversion  problem  is  solved  if  any  state  of  the  FSM  is  reachable ,  because  then  all 
we  have  to  do  is  to  find  the  state  whose  corresponding  output  is  closest  to  the 
desired  y0- 

The  state  transition  function  :  'fo  x  U  — »  can  be  best  described  in  terms 
of  two  state  operations,  INC:  and  DEC:  4 td  — ,  \l>d.  For  any  state 

if  G  dfo,  we  can  immediately  determine  the  current  input  u(if):  u(if)  =  un+\  if  if 
contains  n  “l”s.  For  if  G  we  define 


INC  (if) 


if)  if  u{if)  =  Ul+i 

the  state  after  the  input  is  increased  by  one  level  if  u(if)  ul+. i. 


and 


DECfip) 


if)  if  u(if)  =  Ui 

the  state  after  the  input  is  decreased  by  one  level  if  u(if)  ^  U\ 


As  one  can  easily  verify,  INC  changes  the  first  “0”  bit  counting  from  the  right  to 
“1”  and  leave  other  bits  untouched.  A  symmetric  remark  applies  to  the  operation 
DEC.  Therefore  bit  L  (bit  1,  resp.)  is  the  most  (least,  resp.)  important  bit,  in 
the  sense  that,  if  you  want  to  switch  bit  j  from  0  (1,  resp.)  to  1  (0,  resp.),  you 
must  first  switch  all  the  lower  bits  to  1  (0,  resp.).  Figure  2.14  illustrates  the  INC 
and  DEC  operations  for  the  case  of  L  =  3. 

Now  given  u  G  U,  the  state  transition  function  is  expressed  as: 

if,  if  u  —  u(if>)  =  0 
INC  o  •  •  •  INCfiff),  if  u  —  u{ib)  =  nSu 

S  NX  ^ 

n  INCs 

DEC  o  •  •  •  D It C fif),  if  u  —  u(if))  =  —n8u 

s  v-  / 

n  DEC'S 


34 


Figure  2.14:  Operations  INC  and  DEC  for  L  —  3. 
where  “o”  denotes  composition  of  functions. 

Proposition  2.4.4  Any  state  is  reachable.  Let  i  =  1,2,  be  two  states.  Let 
bit  no  be  the  leftmost,  bit  at  which  ipi  and,  ip2  differ,  and  let  na  be  the  number  of 
alternating  bit  pairs  in  tp2  from  bit  no  through  bit  1.  Then  tp2  is  reachable  from  iji\ 
by  applying  an  input  string  s*a  E  SA  of  length  na  +  1,  and  the  length  of  any  other 
sa  E  Sa  achieving  the  state  transition  from  ib\  to  is  no  less  than  na  +  1. 

The  proposition  is  a  straightforward  consequence  of  the  state  transition  map 
The  following  example  illustrates  the  proposition  as  well  as  how  to  actually  con¬ 
struct  the  input  string. 

Example  2.4.5  Assume  L  =  5,  'tp\  —  00100,  ip2  =  01011.  Then  n0  =  4,  na  =  2, 
and  the  alternating  input  string  s*  for  achieving  the  state  transition  has  length  3. 
Now  let’s  detail  the  procedure  of  state  transition: 

•  Step  0.  ip i  contains  one  “1”,  so  the  current  input  value  is  u2; 

•  Step  1.  Apply  it5  (3  consecutive  INCs)  to  make  bit  4  "1”  and  the  state 
becomes  01111; 

•  Step  2.  Apply  u2  (3  consecutive  DECs)  to  make  bit  3  “0”  and  the  state 
becomes  01000; 
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•  Step  3.  Apply  U4  (2  consecutive  INCs )  to  get  ip2. 

Remark  2.4.6  A  state-space  representation  of  a  general  Preisach  operator  can  be 
found  in  [37]  and  it  is  shown  there  that  the  state  space  is  approximately  reachable , 
see  Proposition  3.4.10.  This  “approximate  reachability”  result  was  also  stated  in 
[58,  86]  (in  a  more  casual  way). 

Corollary  2.4.7  Any  state  is  reachable  from  any  other  state  with  some  s*  €  Sa 
of  length  no  more  than  L. 

2.4.2  A  state  space  reduction  scheme 

Reduction  of  the  state  space 

In  general  we  need  store  output  values  of  2L  states  for  the  value  inversion  problem. 
For  a  reasonable  discretization  level  L,  this  may  take  lots  of  memory.  In  addition, 
computation  cost  for  sorting  and  searching  these  states  will  be  very  high.  There¬ 
fore  reducing  the  number  of  states  without  compromising  control  accuracy  is  of 
practical  interest. 

Two  states  V-’i ,  '02  €  d-h/  are  equivalent  denoted  as  f)\  =  -02,  if 

I  s.  ri]  =  l>.r,].  Vs  e  S. 

We  say  a  hysteron  in  the  discretized  Preisach  operator  is  non- trivial  if  its  associated 
weight  is  not  zero,  and  is  trivial  otherwise.  Existence  of  trivial  hysterons  leads  to 
equivalent  states.  Let’s  look  at  an  example.  In  Figure  2.15(a),  the  hysterons 
marked  with  “•’’(and  labeled  by  ,75)  are  assumed  to  be  lion-trivial  and 

those  marked  with  “o”  are  assumed  to  be  trivial.  It’s  easy  to  verify  that  the 
following  states  in  Figure  2.15(a)  are  equivalent:  0101,  0110,  1001  and  1010. 
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Figure  2.15:  (a)  Existence  of  equivalent  states  (L  =  4);  (b)  Illustration  of  the 
shaded  set. 

For  % ')  G  d-h/,  define  S(ip)  to  be  the  set  of  lion-trivial  hysterons  underneath 
the  memory  curve  corresponding  to  ip.  From  the  example  above,  we  can  see  that 
ip i  =  ip2  if  and  only  if  S(ip i)  =  <S(-t '•_>)•  From  the  experimental  result  of  measure 
identification  (see  Figure  2.8),  we  see  that  indeed  many  hysterons  carry  weights  of 
zero  or  close  to  zero,  and  this  provides  room  for  the  state  space  reduction. 

The  original  state  space  4/ d  is  thus  a  disjoint  union  of  equivalent  classes  of 
states.  A  reduced  state  space  4/  is  obtained  such  that  each  element  in  T  is  an 
equivalent  class  in  4^,  i.e.,  =  4^/  =.  Denote  the  set  of  non-trivial  hysterons 

as  Af,  i.e.,  A f=  {7 p>a  :  ui3a  >  0},  where  u^a  is  the  weight  of  7^.  Then  a  subset 
ip  of  J\f  can  be  identified  with  a  member  of  'F  if  and  only  if  3  ip  G  4b/,  such  that 
ip  =  S(ip).  To  better  capture  the  latter  property,  we  introduce  the  notion  of  a 
Lower-Left-Shaded  Set  .  The  Lower-Left- Shaded  Set  (abbreviated  as  the  shaded 
set  hereafter)  A(fi3Xi)  of  a  hysteron  7-o>  G  N  is  defined  to  be 

-4(7/3, a)  =  {l0’,a’  €  A f  :  7/3', a!  7^  7 P,cti  ft  Si  0,  Ol'  <  «}. 

The  geometric  interpretation  of  the  shaded  set  of  is  clear:  imagining  two  rays 
from  7/3, a  in  the  Preisach  plane,  one  pointing  downwards  and  the  other  to  the 
left,  the  shaded  set  consists  of  non-trivial  hysterons  lying  between  the  two  rays. 
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For  example,  in  Figure  2.15(b),  .4.(75)  =  {71,72,73}-  If  7 p,a  lies  underneath  some 
memory  curve  ip',  all  elements  of  A{ji3M)  must  do  so  too.  Therefore  we  conclude 
that  ip  C  Af  is  identified  with  a  member  of  T  if  and  only  if  the  following  holds: 

A(%,a)  C  ip  ,  V  7/t a  G  '0-  (2.24) 

To  ease  presentation,  from  now  on  we  will  simply  write  ip  £  T  if  (2.24)  is  satisfied. 
Now  we  can  list  all  members  in  T  using  a  tree-structured  algorithm: 

•  Step  1.  List  the  equivalent  class  having  110  11011-trivial  hysterons  (negative 
saturation); 

•  Step  2.  List  equivalent  classes  with  one  constituent  lion-trivial  hysteron, 
i.e.,  the  shaded  set  of  every  such  hysteron  is  empty; 

•  Step  3.  Starting  from  each  equivalent  class  (parent  class )  ip  with  n  non¬ 
trivial  hysterons,  we  list  equivalent  classes  ( children  classes )  with  n  +  1  non¬ 
trivial  hysterons  by  finding  another  hysteron  7  6  M  such  that: 

—  7  is  not  included  in  ip, 

—  A( 7)  C  ip,  i.e.,  ip  U  7  is  an  eligible  member  of  T,  and 

—  ip  U  7  does  not  coincide  with  any  other  equivalent  class  ip'  with  n+  1 
constituent  hysterons  that  has  been  listed  so  far; 

•  Step  4.  Continue  Step  3  until  ip  =  J\f  (positive  saturation)  is  listed. 

The  equivalent  classes  are  sorted  according  to  their  output  values  during  the 
above  enumeration  process,  and  we  save  computation  time  by  using  the  fact  that 
the  output  of  a  child  class  is  always  greater  than  that  of  its  parent. 


Generating  best  representative  states 


For  the  purpose  of  realizing  state  transition,  we  need  find  a  representative  state 
ib  G  dfi/  for  every  ib  G  T.  From  Proposition  2.4.4,  the  number  of  alternating  bit 
pairs  of  a  state  ib  is  closely  related  to  the  number  of  input  reversals  required  for  the 
state  transition.  Therefore  the  best  representative  state  bo*  G  vFf/  for  ip  G  VP  should 
have  the  least  number  of  alternating  bit  pairs  among  all  states  in  the  equivalent 
class  ij). 

We  generate  a  representative  ib*  for  ib  G  T  by  first  drawing  two  memory  curves 
%b\  and  'tb'b  and  then  picking  ib*  to  be  the  one  whose  number  of  alternating  bit  pairs 
is  less.  When  we  draw  a  memory  curve,  at  most  two  directions  are  possible  for 
each  segment:  going  downwards  (denoted  by  “j”)  or  going  to  the  right  (denoted 
by  ijT  is  obtained  as  follows:  start  from  the  left  upper  corner  with  “j” 

and  continue  that  direction  as  long  as  it  is  feasible  to  do  so  (i.e.,  no  constituent 
hysteron  of  ib  is  left  out):  when  it  is  infeasible  to  continue  “j”,  we  switch  to  >” 
and  keep  going  with  that  direction  until  it  is  infeasible  for  ib  (i.e.,  non-constituent 
hysterons  will  be  included).  We  continue  until  all  L  segments  are  drawn.  Similarly 
we  obtain  ib*b,  by  starting  with  .  Note  it’s  easy  to  see  that  ►”  is  feasible 
whenever  “J,”  is  not,  and  vice  versa. 

Proposition  2.4.8  The  representative  ib*  obtained  in  the  above  scheme  has  the 
least  number  of  alternating  bit  pairs  among  all  states  in  the  equivalent  class  if. 

Proof  For  any  state  ib  starting  with  “j” ,  we  can  show  its  number  of  alternating 
bit  pairs  is  no  less  than  that  of  ib*  by  exploiting  the  strategy  in  generating  ib*. 

Instead  of  giving  a  general  proof,  we  will  illustrate  the  essential  idea  by  looking 
at  a  concrete  example  with  discretization  level  L  =  8  (Figure  2.16).  In  the  figure, 
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Figure  2.16:  Illustration  of  the  proof  of  Proposition  2.4.8. 

we  assume  that  the  memory  curve  represented  by  the  bolded  lines  A-B-C-D-E 
(“00111001”)  is  ip\.  Let  ip  be  any  other  state  in  the  same  equivalent  class  ip 
starting  with  “j”.  Now  imagine  we  are  growing  the  two  curves  ip{  and  ip  segment 
by  segment,  starting  from  the  left  upper  corner.  The  curve  ip  has  to  switch  to  ►” 
no  later  than  it  reaches  the  point  B  (since  otherwise  it  will  be  infeasible).  This 
implies  that  when  we  encounter  the  first  alternating  bit  pair  in  ip*L>,  we  must  have 
encountered  at  least  one  alternating  bit  pair  in  ip.  For  the  same  reason,  ip  has  to 
switch  to  “j”  before  ip\  does  so  at  point  C.  This  argument  goes  on  until  we  hit 
the  line  a  =  ,3  and  stop.  Therefore  whenever  one  alternating  bit  pair  occurs  in 
ip*,  at  least  one  alternating  pair  occurs  in  ip.  Hence  the  number  of  alternating  bit 
pairs  in  ip  is  no  less  than  that  in  ip*.  The  curve  represented  by  the  dashed  lines 
A-F-G-H-I-E  in  Figure  2.16  gives  an  example  of  such  ip. 

Analogously  for  any  state  ip  starting  with  we  can  show  its  number  of 

alternating  bit  pairs  is  no  less  than  that  of  ip*_^.  The  proof  is  now  complete.  ■ 

Example  2.4.9  For  the  equivalent  class  {71,72,73}  in  Figure  2.15(a),  ipl  =  0110 
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with  2  alternating  bit  pairs  and  ip*P  =  1001  with  the  same  number  of  alternating 
bit  pairs.  So  ip*  =  ipf  (or  ip*_P). 

2.4.3  Experimental  results 

We  now  apply  the  value  inversion  scheme  and  the  state  space  reduction  scheme 
to  micro-positioning  control  of  a  magnetostrictive  actuator.  The  Preisach  plane  is 
discretized  with  L  —  25,  which  results  in  300  hysterons.  By  treating  201  hysterons 
whose  weights  are  zero  or  very  small  as  trivial,  we  are  left  with  99  non-trivial  hys¬ 
terons.  The  final  number  of  states  in  the  reduced  state  space  is  99,217,  compared 
with  33,554,432,  the  number  of  states  in  the  original  state  space. 

Given  a  sequence  of  8  desired  displacement  values  (10  /mi,  30  /mi,  15  /mi, 
40  /mi,  20  /mi,  40  /mi,  60  /mi  and  50  /mi),  we  want  to  drive  the  actuator  head  to 
these  positions  consecutively.  Three  control  schemes  are  implemented  to  achieve 
the  positioning  goals.  The  first  one  is  based  on  the  value  inversion  scheme,  the 
second  one  is  based  on  the  closest  match  algorithm,  and  the  third  scheme  is  based 
on  a  non-hysteretic  model  where  the  input-output  relationship  is  approximated  by 
a  single-valued  function  y  =  —7.441*  —  2.637"  +  40.81/  +  30.34.  The  current  input 
and  the  measured  displacement  are  shown  in  Figure  2.17  through  Figure  2.19. 
We  intentionally  hold  the  input  current  constant  for  about  1  second  after  each 
positioning  is  completed.  Figure  2.20  compares  the  errors  of  the  three  schemes  for 
the  eight  positioning  tasks.  We  see  that  Scheme  1  yields  the  minimum  positioning 
error.  As  a  trajectory  inversion  algorithm,  Scheme  2  does  not  allow  input  reversals 
for  each  desired  output  value  and  thus  has  less  control  freedom  than  Scheme  1 
does.  This  can  explain  why  scheme  1  is  better  than  scheme  2.  Scheme  3  delivers 
the  worst  performance  because  hysteresis  is  not  taken  into  account. 
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Figure  2.17:  Micro-positioning  control  based  on  the  value  inversion  scheme. 
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Figure  2.18:  Micro-positioning  control  based  on  the  closest  match  algorithm. 
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Figure  2.19:  Micro-positioning  control  based  on  a  non-hysteretic  model. 


Figure  2.20:  Comparison  of  three  schemes.  Scheme  1:  the  value  inversion  algo¬ 
rithm;  Scheme  2:  the  closest  match  algorithm;  Scheme  3:  the  inversion  algorithm 
based  on  a  non-hysteretic  model. 
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Chapter  3 


A  Dynamic  Model  for 
Magnetostrictive  Hysteresis 


When  the  input  frequency  gets  high,  the  magnetostrictive  hysteresis  is  rate  de¬ 
pendent1  (Figure  3.1)  due  to  the  eddy  current  effect  and  the  magnetoelastic  dy¬ 
namics  of  the  actuator  rod  [82,  83].  The  rate-dependent  hysteresis  can  no  longer 
be  modeled  by  a  Preisach  operator  alone.  In  this  chapter,  we  propose  a  novel 
dynamic  model  for  the  magnetostrictive  hysteresis,  consisting  of  a  Preisach  oper¬ 
ator  coupled  to  an  ordinary  differential  equation  (ODE)  in  an  unusual  way.  Due 
to  its  special  structure,  the  model  presents  interesting  problems  in  analysis  and 
computation.  We  establish  the  well-posedness  of  the  model  and  study  its  various 
system-theoretic  properties.  Existence  of  periodic  solutions  is  proved.  Numeri¬ 
cal  integration  schemes,  parameter  identification  methods  and  an  inverse  control 
scheme  are  presented. 

1In  some  literature,  e.g.,  [86,  20],  the  word  hysteresis  is  referred  to  rate-independent  memory 
effects  only.  We  use  “hysteresis”  in  the  more  general  sense  in  this  dissertation. 
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3.1  A  Dynamic  Hysteresis  Model 


Venkataraman  and  Ivrishnaprasad  proposed  a  bulk  magnetostrictive  hysteresis 
model  based  on  energy  balancing  principles  [82,  83].  The  model  has  a  cascaded 
structure  as  shown  in  Figure  3.2.  The  block  W  takes  care  of  the  M  —  H  hysteresis 
and  the  eddy  current  losses.  G(s)  is  a  lumped  second  order  linear  system  modeling 
the  magnetoelastic  dynamics  of  the  rod. 


Figure  3.2:  Model  structure  of  a  magnetostrictive  actuator. 


We  now  have  a  closer  look  at  the  block  W.  Due  to  the  finite  resistivity  of  the 
magnetostrictive  material,  there  are  eddy  currents  circulating  inside  the  rod.  One 
way  to  represent  the  eddy  current  losses  is  to  place  a  resistor  Reddy  in  parallel  with 
a  hysteretic  inductor  [23,  82],  as  shown  in  Figure  3.3.  We  note  that  this  is  a  phe¬ 
nomenological  approach  and  the  underlying  details  of  the  eddy  current  dynamics 
are  ignored  here.  Considering  the  thin  structure  of  the  rod,  we  assume  that  the 
magnetic  flux  density  B  is  uniform  over  the  cross  section  of  the  magnetostrictive 
rod.  Then  the  voltage  V  across  the  nonlinear  inductor  is  NmAm where  Nm  is 
the  number  of  turns  of  the  coil,  and  Am  is  the  cross  sectional  area.  Let  /  be  the 
input  current  applied,  and  I\  be  the  current  flowing  in  the  inductor  branch.  Since 
v  =  (/  -  h) Reddy,  we  have 


dB 

dt 


Reddy  (  T  T  , 

1  ymJ*-rn 


(3-1) 


Iu  SI  units,  B  =  Ho(H  +  M),  where  //0  =  47t  x  10  7Henry/m  is  the  permeability 
of  vacuum.  H  is  related  to  / 1  via  H  —  coh,  where  Cq  is  the  coil  factor.  The 
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Figure  3.3:  Representation  of  eddy  current  losses  in  a  magnetostrictive  actuator 
[82]. 

constitutive  relationship  between  M  and  H  was  modeled  by  a  low  dimensional  bulk 
ferromagnetic  hysteresis  model  in  [82]  and  that  led  to  an  overall  model  described  by 
switching  ordinary  differential  equations.  We  use  a  Preisach  operator  T  to  model 
M  —  H  hysteresis  and  obtain  the  following  new  model  for  the  block  W: 

|  H  +  M  =  Cl(I-Z) 

{  Co  ,  (3.2) 

<  M  =  T[H,ip0] 

where  ipo  represents  the  initial  memory  curve  and 

^  A  Reddy 
Ro  R'm-Am 

G(s)  has  a  state  space  representation  [82,  83]  (after  some  manipulations): 

V(t)  +  2 €u)oy(t)  +  u%y(t)  =  ' M2(l ).  (3.3) 

where  y  is  the  displacement,  u0  =  2i r/0,  fo  is  the  first  resonant  frequency  of 
the  actuator,  £  is  the  damping  coefficient,  lrod  is  the  length  of  the  rod,  Xs  is  the 
saturation  magnetostriction  and  Ms  is  the  saturation  magnetization. 
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Note  if  we  set  derivatives  in  (3.2)  and  (3.3)  to  zero,  the  dynamic  model  degen¬ 
erates  to  the  static  hysteresis  model  we  have  discussed  in  Chapter  2: 


H(t)  =  CqI  (t) 

<  .!/(/)  -  r//(-).r„|(/)  •  (3-4) 

y(t)  = 

lvJs 

3.2  Well-posedness  of  the  Model 

Eq.  (3.3)  is  just  an  ODE,  therefore  we  will  focus  on  the  well-posedness  of  (3.2). 


3.2.1  Existence  and  uniqueness 

Eq.  (3.2)  involves  time  derivatives  of  both  H  and  M.  It  is  well  known  that,  in 
general,  a  Preisach  operator  does  not  map  C1  into  Cl.  Indeed,  when  corners  in 
the  memory  curve  are  eliminated,  discontinuities  occur  in  the  output  derivative  if 
the  Preisach  measure  does  not  vanish  in  a  neighbourhood  of  the  corner  [86].  Hence 
we  will  interpret  (3.2)  in  the  sense  of  Caratheodory  [87].  Some  partial  differential 
equations  with  hysteretic  operators  appearing  in  the  principal  parts  have  been 
studied,  see  [86,  20]  and  references  therein.  Existence  and  uniqueness  proof  of 
solutions  to  equations  of  the  form 

V  =  f(t,y,T(y)),  (3.5) 

where  P  is  some  hysteresis  operator,  can  be  found  in  [20].  To  our  best  knowledge, 
no  such  result  has  been  published  for  equations  like  (3.2). 

The  following  lemma  will  be  used  in  the  proof  of  uniqueness  of  the  solution  to 
(3.2). 
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Lemma  3.2.1  (The  Gronwall  inequality)  [24]  Let  X(t.)  be  a  real  continuous 

function  and  v(t)  a  nonnegative  continuous  function  on  the  interval  [a,  b] .  If  a 
continuous  function  y(t)  has  the  property  that 

y(t)  <  A (t)  +  j  u(s)y(s)ds 
J  a 

for  a  <  t  <  b,  then  on  the  same  interval 

r-t 

y{t)  <  X(t)  +  /  A (s)is(s)ef>(T)dTds. 

J  a 

In  particular,  if  \{t)  =  A0,  where  A0  is  a  constant.,  then 

y(t )  <  A,,' 


Theorem  3.2.2  If  the  Preisach  measure  u  is  nonnegative  and  nonsingular,  and 
/(•)  is  piecewise  continuous,  then  for  any  0O  G  for  any  T  >  0,  there  exists  a 
unique  pair  {Hi-),  Mf)}  €  C'([0,7’])xC'([0.T])  satisfying  (3.2)  almost  everywhere. 


Proof  1.  We  first  show  the  existence.  From  t/;0,  one  can  evaluate  initial  values 
77(0)  and  M( 0).  Eq.  (3.2)  is  equivalent  to  the  following:  Vt  G  [0,T], 


|  H(t)  +  M(()  =  H( 0)  +  A/(0)  +  pc,(I(s)  -  B&)ds 


As  in  the  proof  of  the  existence  of  solutions  to  the  heat  equation  with  hysteresis 
in  [20],  we  use  an  Euler  polygon  method  to  approximate  (3.2):  for  N  >  0  and 
In  =  y,  solve  consecutively 


jj(rn+l)  rr(m) 

H  N _ H  N  I 

hN  _r 


I  I  ,  Am  I 
M  N _ mN 

hN 


T[H{™+1) 


Am) 


a  nr  - 


(3.7) 
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for  0  <  m  <  N  -  1,  with  Hf  =  H{ 0),  Mf  =  M( 0),  Ip  =  jL  J^1)hN  I(s)ds , 
and  ipm  the  memory  curve  resulting  from  application  of  the  sequence  { //(/  }™  L.  As 
discussed  in  Chapter  2,  we  tacitly  understand  that  the  input  of  h  is  changed  mono- 
tonically  from  Hp  to  Hp+1\  From  the  continuity  and  the  piecewise  monotonicity 
properties  of  (Theorem  2.1.3),  (3.7)  admits  a  unique  solution  for  Hp+l'1 

and  thus  for  Mp+l  ) .  Furthermore,  by  the  piecewise  monotonicity,  Hp+l'>  —  Hp 
and  Mp+l)  —  Mp  have  the  same  sign,  from  which  we  have 


H(m+l)  _  H(m) 
/?  V 


< 

< 

cid/r1 

rrM 

nN 


Co 

I H 


M , 


CO 


(3.8) 


Since  /(•)  is  piecewise  continuous,  we  have  Vm,  Ip  <  Cj ,  with  Cj  >  0  indepen¬ 
dent  of  N .  From  (3.8),  we  can  get 


I H 


(m)i 


N 


<  (1  + 


Co  A' 


N 


(\Hm+coCr)-c0Cr 


<  e~(\H(0)\  +  c0Ci)  -c0Cr=:C, 


(3-9) 


for  all  m,  and  C  is  independent  of  N .  Boundedness  of  Mp  is  a  natural  conse¬ 
quence  of  (3.9). 

We  obtain  Hn(-),  Mn(-)  G  C'([0,T])  by  linearly  interpolating  {Hp}  and 
{ Mp i.e. ,  =  tHP  +  (1  —  r)Hp+1\  for  t,  =  ( m  +  r)/ijv,  0  <  r  <  1, 

and  analogously  for  M,v(-).  Combining  (3.8)  and  (3.9)  we  see  that  Hp)  is  Lip- 
schitz  continuous  with  Lipschitz  constant  L  —  C\{Cj  +  A)  and  the  same  is  true 
for  Mjv(-)-  Therefore  {Hn(‘)}n>i  is  an  equicontinuous  and  equibounded  family 
of  functions,  and  by  Ascoli-Arzela  Theorem,  by  extracting  a  subsequence  if  neces¬ 
sary,  Hn(-)  — >  H(-)  G  C'([0,T])  uniformly  as  N  — »  oo.  It’s  easy  to  see  that  H(-) 
is  also  Lipschitz  continuous  and  thus  differentiable  almost  everywhere.  Similarly 
Mjv(-)  — »■  M(-)  G  C({0,T})  uniformly. 
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Now  define  e,v(f)  =  HN ( t ) +MN (t)—Ci(I(t)  —  H ^ )  at  t  where  Hpj(t )  and  ATvfd) 
exist.  By  the  definitions  of  //,v(-)  and  Mjv(-),  we  know  e,v(t)  is  well  defined  a.e. 
and  ejv(i)  =  Ci(I{^l)  —  I (t))  —  ci('Hn  ^^(0)  ,  for  t  G  (mfiN,  (ra  +  1)Iin)-  Integrating 

+  V/ \  ( / )  =  Ci(I(t) - — - )  +  ejv(t) 

Co 

from  0  to  t,  and  letting  AT  — >■  oo,  one  can  show  #(•)  and  M(-)  satisfy  the  first  part 
of  (3.6)  and  we  are  left  to  show  M (t)  =  r[i7(-),^0](t),  Vt  G  [0 ,T\. 

Let  M/v  =  r[JTv(-)>  ^o]-  By  the  strong  continuity  of  T,  MN  — >  T[/f(-),  ?/;o]  since 
iLjv(')  #(•)•  Furthermore  we  have  M^(rnh-^)  =  M^irnh^),  0  <  m  <  N.  This 
together  with  the  piecewise  monotonicity  of  T  enables  us  to  conclude 

sup  |A/,v(i)  —  Mjv(t)|  <  LIin- 
te[o.T] 

Therefore  as  N  — ►  oo,  {Mjv}  and  {A/,v}  have  the  same  limit,  i.e., 


M(t)  =  r[H(-U’O](t),Vt£[0,T\. 


2.  We  now  prove  the  uniqueness.  By  contradiction  we  assume  that  there  exist 
two  solutions  A /,  ( - ) }  and  {H2(-),  A/2(*)}  and  Hi(t')  ^  H2(t')  for  some  t'  >  0 

(we  know  fTi(O)  =  H2( 0)).  Define  e#  =  H2  —  H\  and  6m  =  M2  —  Mi.  Using  (3.2), 
we  get 

c / / ( / )  [  eH(s)ds.  (3.10) 

Co  Jo 

Define  t  to  be 

t  =  sup{J  :  e#(r)  =  0 ,Vr  G  [0 ,  t] } . 

t<t' 

By  the  continuity  of  e#,  there  exists  5  >  0  such  that  e#(f)  has  a  constant  sign, 
say,  >  0  (without  loss  of  generality),  on  (t,,i  +  6}.  Using  the  order  preservation 
property  of  T  (Theorem  2.1.3),  eM(t)  >  0,  Vi  G  [t,  t  +  8}.  This  together  with  (3.10) 
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leads  to 


\en(t)\  <  —  [  \eH(s)\ds,\/t  G  [0,t  +  5],  (3.11) 

co  Jo 

which  implies  |e^(t)|  <  0  by  the  Gronwall  inequality,  Vt  G  [0,f +  £],  and  this 
contradicts  |e#(t)|  >  0 ,Vt  £  (i,t  +  5\.  u 

Remark  3.2.3  With  minor  modification ,  the  above  proof  can  be  used  to  show 
existence  and  uniqueness  of  solutions  to  more  general  systems  where  the  right  hand 
side  of  the  first  equation  in  (3.2)  is  replaced  by  some  function  f(H.I)  continuous 
in  I  and  Lipschitz  continuous  in  H . 


3.2.2  Continuous  dependence  on  parameters 

Continuous  dependence  of  the  solution  to  (3.2)  on  the  parameters  and  the  initial 
condition  can  be  proved  using  the  properties  of  the  Preisach  operator  and  analysis 
techniques  for  ODEs  [24]. 

Before  we  go  to  the  main  result  of  this  subsection,  we  first  look  at  some  lemmas 
about  the  Preisach  operator.  In  this  subsection  we  deal  with  nonsingular  Preisach 
measures  exclusively.  Since  we  will  discuss  a  sequence  of  Preisach  operators,  a 
Preisach  operator  with  a  density  function  //,  will  be  denoted  as  P^. 


Lemma  3.2.4  If  {//„.}  and  //,  satisfy 


a)  —  //,(/3,  a)  |  dfida  — »■  0, 


then  Vp’o  G  W,  Vu  G  (7([0,  T}),  r#/„  [u, -ipo]  — >  T^u^o]  uniformly  on  [0,  T). 

Proof  For  any  t  G  [0,T],  for  any  n,  P//n  and  If  have  the  same  memory  curve. 
Therefore 


]//„(  o',  a)  —  pi  ).  a)\d,Qda, 
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and  the  conclusion  follows. 


Recall  the  definition  of  (Subsection  2.1.1). 

Lemma  3.2.5  Let  /i  be  a  density  function.  Let  { ifn0  G  T}  be  a  sequence  of  mem¬ 
ory  curves  such  that  f  Jp  ju((3,  a)|^n0(/3,  n)  (ril(  L  a)\df3da  — >  0  for  some  ip o  G  'h. 
Let  the  sequence  un  G  C{{ 0,T])  — »  u  G  C/fO,!1])  uniformly.  Then 

T^lunfifno]  Vp[u,ifo]  uniformly  on  [0,T]. 


Proof  This  follows  from  Theorem  IV. 3. 4  in  [86],  page  114.  ■ 

Lemma  3.2.6  Let  {fin}  be  a  sequence  of  density  functions  such  that  f  fp  \nn(/3,  a)  — 
n(P,a)\ d/3da  — ►  0  /or  some  Borel  measurable  function  fi.  Let  {ifn o  G  T}  6e  a  se¬ 
quence  of  memory  curves  such  that 

I  j  H(S,n)\Q.,J.i.n)  -  fv,o(0,  a) \dpda  -»■  0, 

for  some  if 0  G  V .  Let  the  sequence  un  G  C7( [0, T1] )  — ►  u  G  C'([0,T])  uniformly. 
Then  TPn[un,ifn0\  ->  rM[u,t/0]  uniformly  on  [0,T]. 

Proof  We  get  this  by  combining  Lemma  3.2.4  and  Lemma  3.2.5.  ■ 

Remark  3.2.7  Theorem  IV. 2. 5  in  [86]  shows  continuous  dependence  ofTp[u,ifo] 
on  Hi  ui  Vo.  and  there  convergence  of  the  Preisach  measure  in  the  weak*  sense  is 
assumed,  which,  enables  one  to  conclude  only  pointwi.se  convergence  for  the  output. 
We  use  a  stronger  assumption  fin  I1  in  the  L L  norm  in  Lemma  3.2.6  and  the 
payoff  is  that  we  get  the  uniform  convergence  for  the  output. 
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Theorem  3.2.8  Let  {H(-),  M(-)}  be  the  solution  to  (3.2)  with  the  initial  condition 
•0o  €  VI/.  Assume  that  the  Preisach  measure  u  for  T  in  (3.2)  is  nonnegative  and 
nonsingular ,  and  let  //  be  the  corresponding  density  function.  Consider  a  sequence 
of  equations: 


Hn  +  Mn  =  cnl(In  -  g£) 
Mn  —  I  f,tl  [Hn,  ifno] 


(3-12) 


where  {pn}  is  a  sequence  of  nonnegative  density  functions,  { In  E  P(7([0,T])}  is 
a  sequence  of  piecewise  continuous  functions.  From  if o  and  if n o,  u/e  get  H(0)  and 
Hn(0),  respectively.  If  the  following  assumptions  are  satisfied: 


cni  — *  c%,  i  =  0, 1,  (3.13) 

In  — ►  I  uniformly  on[0,T\,  (3-14) 

#„(0)  -  tf(0),  (3.15) 

I  j  \nn(0,  a)  —  a)\d/3da  — ►  0,  (3.16) 

y  j  a)ICv-no(/^  °)  -  <v«oO0»  ->  0,  (3.17) 

then  {Hn(-),  Mn(-)}  — ►  {//(•),  A/(-)}  uniformly  on  [0,7’]. 


Proof  From  Theorem  3.2.2,  (3.12)  has  a  unique  solution 


{ff„(.),A/„(.)}eC([0,r])xC([0,T]). 


Let  E  C  R2  be  a  compact  set  containing  in  its  interior  the  graph  of  H  =  H(t)  for 
0  <  t  <  T.  Let  Q  >  0  be  such  that 

|ci(/(s)  -  — )|  <  Q,  V(s,H)eE. 

Co 

From  (3.13)  and  (3.14),  there  exists  hi,  such  that  when  n  >  hi  (by  increasing  Q  if 
necessary) 

\Cnl(In(s)-—  )|  <Q,  V(s,H)eE. 
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Choose  5  >  0  such  that  the  rectangle 


R  =  {(s,  H)  :  0<s<6,\H-  H{ 0)|  <  2QS}  C  E. 

By  (3.15),  there  exists  n 2,  such  that  when  n  >  n2,  \Hn(0)  —  Lf(0)|  <  Q5  and  hence 
(0,  Hn(0))  €  R.  Now  from  the  piecewise  monotonicity  of  L/in,  HnMn  >  0,  which 
implies  \Hn\  <  Q.  Therefore  (t,  Hn(t ))  G  /?.  for  0  <  t  <  5. 

{Hn(-)}  is  a  sequence  of  equicont-inuous  (with  same  Lipschitz  constant.  Q)  and 
equibounded  functions  on  [0,(5],  From  the  Ascoli-Arzela  Theorem,  by  extracting  a 
subsequence  if  necessary,  Hn  C([0,5])  uniformly.  Using  Lemma  3.2.6, 

Mn  — >•  M  =  T^H^o]  uniformly  on  [0,(5]. 

It’s  easy  to  show  that  {H(-),  A/(-)}  solves  (3.2)  on  [0,<5].  By  the  uniqueness  of  the 
solution  to  (3.2),  we  have  {£[{■),  =  {H{-),M(-)}  on  [0,  A] .  Hence  we  have 

shown  any  subsequence  of  (and  thus  the  whole  sequence)  {Hn(-),  Mn(-)}  converges 
to  {H(-),  M(*)}  uniformly  on  [0,  <5].  Following  a  standard  argument  for  ODEs  (see, 
e.g.,  [24]),  the  region  of  uniform  convergence  can  be  extended  to  [0 ,T].  ■ 

3.3  A  New  Perspective  to  Study  the  Model 

In  this  section,  we  look  at  (3.2)  from  a  different  perspective.  This  will  lead  to  an  al¬ 
ternative  proof  for  the  well-posedness  as  well  as  provide  insight  into  understanding 
of  various  properties  of  the  model. 

We  define  an  operator  B  :  C([0,T})  x  T  — ►  C([0,T}),  such  that 

B  //.(.■„  (/)  =  H(t)  +  L  //.r„j(7).  ViL  G  C({0,T}),  VUo  G  T,  (3.18) 

where  T  is  the  Preisach  operator.  Let  B  =  B[H,ip0]-  Recall  B  =  +  M) 


55 


(Section  3.1),  hence  the  physical  interpretation  of  B  is  the  scaled  magnetic  flux 
density. 


If  for  any  B  £  C([0,T\)  and  any  ip0  £  \P,  there  exists  a  unique  H  £  (7([0,T]), 
satisfying  B  =  B[H,-ip0\,  then  in  terms  of  B,  (3.2)  can  be  written  as: 


B 


Co 


(3.19) 


where  B  1  denotes  the  inverse  operator  of  B.  Eq.  (3.19)  is  of  a  more  amenable 
form  and  people  have  studied  such  systems,  see  [20]  and  the  references  therein. 
For  an  interval  we  define 


Cj([0,T})  =  {u  £  C([0,T\)  :  u(t)  £  jyt  £  [0,T]}. 


Let  Jh  =  [ Hmin,Hmax ]  C  M  be  the  range  of  H .  Then  the  Preisach  operator 
r  .  C jyj  ( [0 ,  ] )  x  T  *  CjM  ( [0 ,  T j ) ,  w  hei  e  Jm  —  [ •>  -M-max ]  and  Mmin  (Mmax  i 
resp.)  is  the  negative  (positive,  resp.)  saturation  corresponding  to  Hmin  (H  max ) 
resp.) 


Proposition  3.3.1  Let  the  Preisach  measure  of  F  be  nonsingular  and  nonnega¬ 
tive.  Then  for  any  by  G  db  the  mapping 

B[-,*]:C*([0,T|)-*C'*([0,r|) 


is  piecewise  monotone ,  continuous  and  injective ,  where 


Jb  \Lhjnin  “F  1  h/niti  ■  Hmax  Mn 


Proof  It’s  obvious  that  the  range  of  B  C  CjB({ 0,  T])  and  B  is  piecewise  monotone. 
Continuity  of  B[-,if 0]  follows  from  that  of  1  ' • , v. \» ] .  To  show  B\-,ipo\  is  injective, 
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consider  Hi,  H2  £  Cjh([0,T])  and  H\  (t)  ^  H2(t)  for  some  t  £  (0 ,  T).  We  can 
find  //,  0  <  /;'  <  t,  and  <5  >  0,  such  that  // 1  (t)  =  H2(t),Vt  £  [0,  //]  and  (without 
loss  of  generality)  H\(t)  —  H2(t)  >  0,  Vi  £  (t',tr  +  6}.  From  the  order  preservation 
property  of  T,  we  have  B[Hi,if)0](t)  >  B[H2,'ipo](t),  Vi  €  +  5],  which  proves 

the  claim.  ■ 

We  can  also  show  B  is  surjective.  We  first  present  a  lemma  which  will  be  used 
in  the  sequel. 

Lemma  3.3.2  [86]  Let  X,Y  be  metric  spaces,  f  \  X  —>  Y  be  continuous  and 
Y  C  f(X)  be  dense  in  Y .  Also  assume  that  for  any  relatively  compact  set  I\  C  Y , 
the  set  f~l{K)  =  {.x*  £  X  :  f(x)  £  A'}  is  relatively  compact.  Then  f(X)  =  Y .  If 
moreover  f  is  injective,  then  f~l  :  1'  — ►  X  is  continuous. 

Proof  For  any  y  £  Y ,  we  can  find  a  sequence  {yn  £  1'}  convergent  to  y.  Then 
for  any  choice  of  xn  £  f~l(y»),  the  sequence  {xn}  is  relatively  compact,  hence 
xn>  —*■  x  for  some  subsequence  {xn>}  and  some  x  £  X.  Since  /  is  continuous,  we 
have  f(x)  =  y  and  therefore  /  is  surjective. 

Let  now  /  be  injective.  By  the  same  argument  as  above,  if  yn  — >  y  £  Y,  we 
get  f~l(yn )  — >  x  =  f~l{y)  since  x  is  unique  and  independent  of  the  choice  of  the 
subsequence.  ■ 

For  u  £  C([0,  T]),  we  define 

oscu  =  max  u  —  min  u,  V  \t,  \ ,  t2]  C  [0,  71. 

[n,fe]  [ti,t  2]  [0x2] 

Lemma  3.3.3  Let  the  assumption  in  Proposition  3.3.1  hold.  Then  YAo  € 

osc  H  <  osc  B\H ,  F’o],  V  H  £  C([0, 71),  V  [A,  t2]  C  [0,  Tl.  (3.20) 

[tut2]  [tut2]  '  ' 
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Proof  Let 


t*  =  are  max  H,  i*  =  are  min  H. 

[ti,t2]  '  [tiM 

It’s  easy  to  verify  that  T[H,if)0](t*)  >  T{H,  ifo](t*)-  Hence 

oscB[H,ip o]  >  B[H,ip0\(t*)  -  B[H,  ip0}(Q  >  oscH. 
[tlM  [*1.*2] 


Proposition  3.3.4  Let  the  assumption  in  Proposition  3.3.1  hold.  Then  for  any 
ip o  G  T,  B[-,ip 0]  :  Cjh([0,T ])  — >  CjB([0,  T])  is  surjective,  and  its  inverse 

(r'M'o] :  CjB([o,r])  —  c*([o,r]) 


is  continuous. 

Proof  The  results  will  follow  from  Lemma  3.3.2,  by  letting  A"  =  CjH([ 0,T]), 
Y  =  Cjb([0,T]),  f  =  Bj,ij)o\,  and 

Y  =  Cpm,jB([0,T])  =  {u  G  CJb([0,T])  :  u  is  piecewise  monotone}. 

We  now  verify  that  the  assumptions  in  Lemma  3.3.2  are  satisfied. 

From  Proposition  3.3.1,  /  is  continuous  and  injective.  Y  is  obviously  dense  in  Y . 
To  show  Y  C  f(X ),  we  adopt  a  technique  used  in  [85].  Given  ip0  G  T,  we  evaluate 
H( 0)  and  M(0).  Without  loss  of  generality,  we  assume  B  G  Cprn  jB{{().  71])  has 
only  one  monotonicity  partition  with  B( 0)  <  B(T )  (The  argument  extends  to  the 
case  of  multiple  partition  regions  easily).  We  assume  the  compatibility  condition 
is  satisfied2,  i.e.,  B( 0)  =  H(0)  +  M( 0).  We  want  to  find  H  G  CpmjH([0,T}),  such 

2If  this  condition  fails,  we  will  have  to  “blow  up”  the  point  of  input  discontinuity  into  an 
interval  with  nonzero  length  and  make  the  input  continuous  by  linear  interpolation,  see  [55], 
page  55,  or  [20],  page  51. 


58 


that  B[H,fi o]  =  B.  Let  H(t)  =  H( 0)  + t.  By  the  (strict)  piecewise  monotonicity 
of  B,  there  exists  f,  such  that  B  II.  r„|i  /'!  =  B(T).  Let  Hfit)  =  H( 0)  +  ft, 
and  B\  =  B{II] /tpo).  Then  5i  is  strictly  monotone  increasing,  5[(0)  =  5(0)  and 
Bi(T)  —  B(T).  Now  we  introduce  a  time  transformation  d> :  [0,T]  — >  [0,T]  so  that 
(pit)  —  Bp1  o  B{t).  It’s  easy  to  see  H  =  H\  o  d>  will  yield  B. 

We  are  left  to  show  f~l(K)  is  relatively  compact  for  any  relatively  compact 
set  K  C  Y.  Using  Lemma  3.3.3,  the  set  B~X[K, ibo\  is  equicontinuous  if  I\  C 
Cprn.jB  ( [0 ,  T])  is.  Then  we  conclude  with  the  Ascoli-Arzela  Theorem.  ■ 

Remark  3.3.5  Properties  of  B  and  B~'  we  have  shown  so  far  and  the  Lipschitz 
continuity  of  B~[  (to  be  shown  next)  parallel  those  ofT  and  T-1  when  the  Preisach 
measure  u  satisfies  an  extra  assumption: 

^(A(Al,  A2))  >  0,  V  [A i ,  Ao]  C  Jhi  Ai  <  A2,  (3.21) 

where  A(Ai,  A2)  =  {(/?,a)  €  P  :  Aj  <  !:>  <  0  <  A2}  [21,  86,  20].  This  is  not 
surprising  since  the  operator  B  can  be  regarded  as  a  Preisach  operator  T'  with 
measure  u'  =  u  +  uq,  where  u0  is  a  strictly  positive  singular  measure  concentrated 
on  the  line  a  =  /3. 

Proposition  3.3.6  Let  the  assumption  in  Proposition  3.3.1  hold.  Then  the  oper¬ 
ator  £>[•,•]  is  causal  rate-independent,  and  order  preserving. 

Proof  Straightforward.  ■ 

Theorem  3.3.7  Let  the  assumption  in  Proposition  3.3.1  hold.  Then  V'  L’o  €.  T, 

1  [• ,  L’o]  is  Lipschitz  continuous  with  Lipschitz  constant  2,  i.e., 

sup  | Hfit)  -  H2(t)\  <  2  sup  \B\IL.cfiU)  -  B{H2,xf0](t)\,  (3.22) 

te[o,T)  te[o,r] 

for  HuH2eCjH([0,T}). 
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Proof  Essential  ideas  of  the  proof  are  borrowed  from  [21],  where  the  inverse 
of  the  Preisach  operator  I  is  shown  to  be  Lipschitz  continuous  if  (3.21)  and  the 
following  condition  are  satisfied: 

x(x)  >  Cx,  yx  >  0,  (3.23) 

for  some  constant  C  >  0,  where 

X(x)  =  min{i/(A(A,  A  +  x))  :  Hmin  <  A  <  Hmax  -  x}. 

It  suffices  to  show  that,  Vi  G  [0,T], 

-  //,(/)!  <  2  sup  \B[HuW(t)  ~  B{H2,Wo\(t)\.  (3.24) 

re  [or] 

If  H[(t)  =  H2(t),  the  claim  is  trivial.  Assume  Hi(t)  <  H2(t)  (the  case  Hi(t)  > 
Ho(t )  is  similar).  Let  the  corresponding  memory  curves  at  t  be  and  ij)2 [t] 
in  (r, s)  coordinates.  If  ipi[t](r)  <  o_>  /  (r).  Vr  >  0,  (3.24)  is  obviously  true.  So 
we  consider  only  the  case  that  t’i[/](r)  >  '02[/](r),  for  some  r  >  0,  as  shown  in 
Figure  3.4  (same  as  Figure  5.1  in  [21]  with  different  notation). 

Define 

r*  =  inf{r  >  0  :  fi  [/](/•)  >  ^2[t](r)}, 
r*  =  inf{r  >  r*  :  Di[f](r)  =  [i]  (r)}. 

Since  i/>i  [t](0)  <  '02[f](O)  and  we  consider  a  compact  support  for  the  Preisach 

measure,  0  <  r*  <  r*  <  oo.  As  illustrated  in  Figure  3.4,  r*  indicates  the  first 

(counting  from  the  left)  bifurcation  point  of  i\)\  [/,]  and  ij)2 [/,]  after  they  first,  intersect, 
and  r*  indicates  the  next  intersection  point  of  the  two  curves.  Define  regions: 

D\  =  {(r,  s)  :  0  <  r  <  r*,  ijj i [t](r)  <  s  <  ^2 [t] (r)}, 

D-2  =  {(r,  s)  :  r.  <  r  <  r*,  ip2[t](r)  <  s  <  Di[t](r)}. 
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s 


Figure  3.4:  Illustration  of  the  proof  of  Theorem  3.3.7. 


Now 


u)(r,  s)dsdr  +  2 


/  u(r,  s)dsdr  +  Uoo, 
Jd2 

(3.25) 


where  z'oo  represents  the  contribution  from  the  region  where  r  >  r*.  Since  u  >  0, 
we  get  from  (3.25) 


H2(t)  -  H^t)  <  B[H2,il>0\(t)  -  K//:.r„]{/)  +  2 


u(r,  s)dsdr  +  u0 


d2 


(3.26) 


Our  next  goal  is  to  show  that  we  can  find  r  €  [0 ,t\,  such  that 


2 


/  u){r ,  s)dsdr  +  u0 Q 

Jd2 


<  B[HuiI>0](t)  -  J3[H'2 ,  t/^o] (t ) j 


(3.27) 


then  (3.26)  and  (3.27)  would  imply  (3.24). 


We  observe  that  both  graphs  of  ip\  [/,]  and  [t]  when  restricted  to  [r*,  r*],  consist 


of  at  least  two  segments  of  different  slopes  and  we  let  m,  be  the  second  segment 
of  i/>,[f]|[r*,  r*]  counting  from  the  left,  where  “|”  denotes  restriction.  Let  U  be  the 
time  when  rn,  is  formed  on  ipi[t].  Let  r  =  max{b|  ,  O}-  Clearly  r  >  0.  We  claim  r 
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satisfies  (3.27).  To  see  this,  let’s  assume,  without  loss  of  generality,  r  =  t\.  Then 

ipi [r]  =  max  {  o  j/|.  </| }. 

tp2 [r]  <  max  { f. ’•_> [t] ,  ip* }  in  [0,  r2] , 

1P2 [r]  —  ip2[t]  in  [r2,oo], 

where  ipi  and  ip*  are  the  straight  lines  with  slope  —1  through  Q\  and  Q*.  Therefore 
T[HuiP0\{t)  >  T[H‘2,  iPo](t  )  and  H\(t )  >  i72(r),  which  implies  (3.27).  ■ 

Corollary  3.3.8  Let  the  assumption  in  Proposition  3.3.1  hold.  Then 

B-‘1.-]:Cjb({0.T])xI’  -+Cjh({0-T\) 

is  Li.psc.hitz  continuous:  Vipi,ip2  G  T,  V  TT) ,  i7>  G  Cjh([0,T]), 

sup  \H[(t)  —  H'2(t)\  <  2  sup  \B[HlMt)-B[H2,ih](t)\  +  2\\ip1-fo  ||,  (3.28) 
fe[o,T]  te[o.T] 

where 

II  A  ~  hp2  ||=  J  J  1C v-i  (/*><*)  -  (v2{P,a)\p(P,a)dpda, 

and  Cv>  is  as  defined  in  Subsection  2.1.1. 

Proof  The  proof  is  almost  identical  to  that  of  Theorem  3.3.7,  except  that  when 
ipi  ip2,  t  =  max{7|  ,  f2}  may  be  0.  But  if  that’s  the  case,  we  immediately  have 


2  /  /  u:(r,  s)d.sdr  +  z'oo  <  2  ||  ipi  —  ip2  ||, 

J  Jd-2 


(3.29) 


which  completes  the  proof. 


Now  we  can  provide  another  proof  for  Theorem  3.2.2  (see  also  Theorem  3.1.1 
in  [20],  page  124): 

Proof  Define  B  as  in  (3.18)  and  rewrite  (3.2)  as  (3.19).  The  latter  is  equivalent 


B(t)  =  B(0)  +  /‘dCM  -  BliB’WoM)da  =:  T[B](t),  (3.30) 

J  0  Co 
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where  B( 0)  =  H( 0)  +  M(0).  From  Theorem  3.3.7,  when  r  is  small  enough,  the 
operator  T  is  a  contraction  mapping  on  a  closed  subset  of  C([0,  r]).  Therefore 
(3.30)  has  a  unique  solution  B  defined  on  [0,  r]  by  the  contraction  mapping  theorem 
(Appendix  A).  Furthermore,  the  solution  can  be  extended  to  the  interval  [0,T]. 
One  can  then  obtain  H  =  B~l  [B,ip o]  and  M  —  T[H,ip0].  m 

From  Corollary  3.3.8,  we  can  obtain  an  explicit  formula  for  the  continuous 
dependence  of  the  solution  to  (3.2)  on  the  initial  condition: 

Proposition  3.3.9  Let  the  Preisach  measure  u  be  nonsingular  and  nonnegative. 
Led  ur  be  the  modulus  of  continuity  for  T.  For  i  =  1,2,  denote  by  {Hj(-),  Mj(-)} 
the  solution  to  (3.2)  corresponding  to  the  initial  memory  curve  fy  G  T.  Then  for 
any  T  >  0,  any  /(•)  G  PC([0,T]),  we  have 

II  H\  —  H-2  ||c([o,r])<  2(aoe  co  +  ||  ifx  —  ||)j  (3.31) 

||  Mi  —  M-2  Hero.  /  oil  ~'r(  ||  LI\  —  H2  ||c([o,t]))>  (3.32) 


where 


a0  =  //  (0)  +  MAO)  -  H,( 0)  -  M2(0)|  +  —  11  A 


Co 


and  ||  •  || c([o,t])  denotes  the  sup  norm  on  C([0,T]). 


Proof  Let  B.t{t)  =  Hi(t)  +  A/,(t),  i  =  1,2.  Then  for  i  =  1,2, 


A  B~X\B  'll:- If  s') 

Bift)  =  Bi(0)  +  /  Ci(I(s)  - 


Jo 


Co 


which  gives  rise  to 


B1(t)-B2(t)  =  B1(0)-B2(0)-^  /  (B  {\B..c{\(s)-B  1  |/L.  r.,](.s-);i//,.  (3.33) 

co  Jo 


From  Corollary  3.3.8  and  the  Cornwall  inequality,  we  obtain 


2c-i  T 


B i  -  Bo  ||c([o,T])<  a0e  co  . 


(3.34) 
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Then  (3.31)  follows  from  Corollary  3.3.8  and  (3.32)  follows  from  the  continuity  of 

r.  ■ 


3.4  System-Theoretic  Properties  of  the  Model 


In  this  section  we  study  system-theoretic  properties  associated  with  the  model 
(3.2).  In  particular,  we  look  at  stability  of  equilibria,  input-output  stability,  reach¬ 
ability  and  observability. 


3.4.1  Stability  of  equilibria 


The  state  for  (3.2)  is  the  (infinite-dimensional)  memory  curve  tjj  G  T  since  both 
H  and  M  can  be  derived  from  y h  We  set  the  input  I  m  0  in  (3.2)  and  investigate 
stability  of  the  equilibria  of  the  following  equation: 


I  H  +  M  =  -^H 

)  CO 

\  M  =  T[H,  Co] 


(3.35) 


To  get  the  set  of  equilibria,  we  let  H  =  M  =  0  in  (3.35)  and  obtain  H  =  0. 
Therefore  the  equilibria  set 


T0  =  {w  G  T  :  the  graph  of  intersects  the  line  a  =  ,6  at  (0,0)}. 

In  (r,  s)  coordinates,  T0  =  {ijj  G  T  :  ip(0)  —  0}.  Note  T0  forms  a  continuum  and 
any  ip  G  T  whose  graph  is  embraced  by  those  of  %Iju  and  tpi  in  Figure  3.5(a)  belongs 
to  T0. 

Stability  of  an  equilibrium  point  is  usually  discussed  in  the  sense  of  Lyapunov 
[54].  An  equilibrium  point  is  stable  if  all  solutions  starting  at  its  nearby  points 
stay  nearby.  It  is  asymptotically  stable  if  it  is  stable  and  all  solutions  starting 
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Figure  3.5:  Stability  of  the  equilibria:  (a)  the  set  'F0;  (b)  evolution  of  -0t  when 
H( 0)  >  0;  (c)  evolution  of  vpt  when  H( 0)  <  0. 

at  nearby  points  tend  to  it  as  time  approaches  infinity.  Different  metrics  can  be 
defined  to  measure  the  distance  between  two  memory  curves  -0 i,  -02  £  \P.  Recall 
the  definition  for  (0,  (Subsection  2.1.1).  We  can  define 

II  0i  -02  ||l,=  j  I '  ICv-i(/^a)  “Cv* (P,a)\dpda,  (3.36) 

and 

II  i'i  -  ih  ||l1iM=  J  J  k  •  {0,  a)  -  C.-2(  h  a)\ a)dpda.  (3.37) 

We  use  the  notation  ||  •  |k,  and  ||  •  ||l,  since  the  metrics  (3.36)  and  (3.37)  are 
defined  in  terms  of  the  L\  norm  where  the  underlying  measures  are  the  Lebesgue 
measure  in  R2  and  the  Preisach  measure  with  density  //,  respectively. 

A  third  metric  uses  the  Hausdorff  distance 

II  d’i  —  d’2  || h=  dn (graph  of  Di,  graph  of  -02 ) ,  (3.38) 


and  we  recall  for  a  metric  space  X  with  metric  d(-,-),  the  Hausdorff  distance 
dH(S[ .  S-j)  between  two  sets  Si,  S-2  C  A"  is  defined 

dniSi,  S2)  =  max{  sup  inf  d(si,  S2),  sup  inf  d(-si ,  32)}- 

Sie5is2e52  '  'S2G52siGS,i 
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Figure  3.6:  Illustration  of  the  definition  jj^-(ip,±):  (a)  definition  of  f  (?/;,+);  (b) 
definition  of  jjg (ip,  -). 

We  now  define  two  quantities,  ^(r.  +)  and  ('0,  — ),  where  ^ (ip,  +)  (jjfi(ip,  — ), 
resp.)  carries  the  interpretation  of  the  derivative  of  M  with  respect  to  H  when  H 
is  being  increased  (decreased,  resp).  For  ip  G  \Eh  Let  (H,H)  be  the  point  where  ip 
intersects  with  the  line  a  =  (3.  Define  (see  Figure  3.6  for  illustration) 

Hi  =  inf {/5  :  (/j,  H)  G  graph  of  ip}, 


and 


sup {o  :  (H,a)  G  graph  of  ip}. 

a 


Note  usually  either  H$  or  Ha  is  equal  to  H. 
We  let 


dM 

Jh 


dM 

1h 


(>•••  -) 


,  A  M+ 

lim  — - - . 

ah->o+  AH 


lim 

AH^O+ 


AM' 

AH 


(3.39) 

(3.40) 


if  the  limits  in  (3.39)  and  (3.40)  exist,  where  A M+  and  AM  represent  contribu- 
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tions  from  the  shaded  areas  in  Figure  3.6(a)  and  (b),  respectively: 


rH+AH 

ra 

A  M+  =  2 

/ 

/  //!  ■;.  n  )(/■  idn  . 

- 

Ih 

JHb 

[H 

pH a 

AM-  =  2 

JH-AH J 

1  n(P ,  a)dadp 

0 

If  n  is  continuous,  (777 (th+)  and  jfpp(if,  — )  are  well  defined  and  given  by 


„,(>•.  ■  )  -2  /  p(3,  H)d/3, 
dH  J  f[0 

(3.41) 

d\i  r11" 

—r—(if,  —)  =  2  /  //(//. n)(/o. 

dH  Jh 

(3.42) 

It  turns  out  that  our  stability  result  is  independent  of  the  metric  we  use. 


Proposition  3.4.1  Assume  that  the  Preisach  measure  is  nonnegative  and  non¬ 
singular,  and  that  the  density  function  is  piecewise  continuous.  Then  each  if  G  'F0 
is  a  stable  but  not  asymptotically  stable  equilibrium  of  (3.35). 


Proof  Consider  if*  G  To.  Denote  ift  the  memory  curve  at  time  t,  >  0  when  the 
system  starts  from  if0  G  'F  at  t  =  0.  We  claim 


ift  -  if 


| <||  if0  —  if*  ||,  Vt  >  0, 


(3.43) 


where  ||  •  ||  is  any  of  the  three  metrics  we  defined  above. 

We  first  show  if  H( 0)  corresponding  to  ifo  is  not  zero,  then  H(t)  — >  0  mono- 
tonically.  By  the  assumptions  of  the  proposition  and  the  implicit  assumption  that 
H  has  a  compact  support,  jjpfift,  sgn{H))  is  well  defined  a.e.,  and 

0  <^(ifnsgn(H))<C,  (3.44) 

for  some  C  >  0.  Then  the  first  equation  in  (3.35)  is  of  the  form 


H 


Cl 

co(l  +  g(t)) 


H, 


(3.45) 
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where  g(t)  =  j[fi(ipu  sgn(H)),  and  0  <  g{t)  <  C.  From  (3.45),  we  can  see  H(t)  — >  0 
monotonically.  This  implies  that  for  ip0  €  4  with  H( 0)  ^  0,  ipt  —■ >  '*Poo  as  illustrated 
in  Figure  3.5  (b)  and  (c).  If  H( 0)  =  0,  i.e. ,  ipo  €  vFo-  then  ipt  =  >Po-  Therefore 
(3.43)  holds  and  any  ip  G  T0  is  stable.  Any  ip  G  xFo  is  not  asymptotically  stable 
due  to  that  XF0  forms  a  continuum.  ■ 

Remark  3.4.2  Although  any  individual  equilibrium  ip  £  \P0  ls  not  asymptotically 
stable ,  'F0  is  “ globally  asymptotically  stable”,  in  the  sense  that ,  starting  from  any 
ipo  €  T,  lim^oo  infv,G^0  ||  ipt  -  ip  ||=  0. 

3.4.2  Input-output  stability 

For  each  ip0  G  T,  (3.2)  defines  a  mapping  from  the  input  /(•)  to  the  output  H(-) 
and  M(-).  Here  we  discuss  the  question  of  finite  gain  stability  for  this  mapping. 

Definition  3.4.3  A  mapping  M  is  finite  gain  C  stable  if  there  exist  7  >  0  and 
bo,  such  that  for  all  u  in  the  input  space , 

II  Mu  ||£<  7  ||  u  Hr  +60,  (3.46) 

where  ||  •  ||£  denotes  the  signal  space  norm.  We  say  M  has  an  C  gain  less  than  or 
equal  to  7.  and  call  b0  the  bias  term. 


We  study  the  finite  gain  stability  and  the  finite  gain  Z,2  stability  of  (3.2). 
Accordingly,  we  define  two  input  spaces 

U0 c  =  {/(•)  :  /(•)  is  piecewise  continuous  and  sup  \I(t)\  <  00}, 

t>o 

A  f°° 

U 2  =  {/(•)  :  /(•)  is  piecewise  continuous  and  /  I2(t)dt  <  00}, 

Jo 
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and  we  write 


/(•)  ||oc=  sup  \I (t) | ,  V/(-)  G  Uoo, 
t>  o 

,  VJ(.)  G  t/2. 

Finite  gain  stability 

Since  the  Preisach  measure  is  assumed  to  be  finite,  \M(t)\  <  Ms,  Vi  >  0  for 
any  input  /(•),  where  Ms  stands  for  the  saturation  magnetization.  Therefore  the 
mapping  from  /(•)  to  M(-)  is  finite  gain  £,<*,  stable  with  7  =  0  and  bo  =  Ma. 

For  the  mapping  from  /(•)  to  H(-),  we  have  the  following: 

Proposition  3.4.4  Let  the  Preisach  measure  be  nonnegative  and  nonsingular. 
Then  V0o  G  T.  VJ(-)  G  C/00,  we  hnn 

II  //(•)  ||oc<  iaax{|//(0)|, c0  ||  /(•)  ||oo}.  (3.47) 

Proof  The  proposition  follows  from  the  observation  that,  due  to  the  piecewise 
monotonicity  of  the  Preisach  operator,  (3.2)  gives 

H  >  0  if  I(t)  > 

H  <  0  if  I(t)  < 

Co 

■ 

Finite  gain  L-2  stability 

The  mapping  from  /(•)  to  M (•)  is  not  L2  stable  in  general.  To  see  this,  consider 
an  example:  Let  the  initial  memory  curve  d’o  G  T  be  such  that  the  corresponding 
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H{ 0)  =  0  (i.e.  Do  E  T0)  and  M( 0)  ^  0,  and  let  1  =  0.  Then  M  =  M( 0)  and  hence 
not  square  integrable  although  ||  /(■)  || 2=  0. 

The  mapping  from  /(•)  to  //(•)  is  finite  gain  L>  stable: 

Proposition  3.4.5  Let  the  Preisach  measure  be  nonnegative  and  nonsingular  with 
a  piecewise  continuous  density  /<.  Then  Vi/y  €  T,  V/(-)  G  U2,  we  have 

S^(-)ll2<7lU(-)ll2+6o,  (3-48) 


where 


7  =  sup  — 


Cl 


w  + 


_£i_ 


co(l+C)  I 


bo  — 


co(l  +  C) 


2Cl 


l^(0)|, 


and  C  >  0  is  the  constant  in  (3.44)- 


Proof  For  each  ip0  E  T,  we  can  rewrite  (3.2)  as 

Cl  tt,. \  ,  c. 


#(f) 


-i/(t) 


(3.49) 


Co(l+.9(f))  ;  !  +  <?(£) 

where  g(f)  is  as  defined  in  the  proof  of  Proposition  3.4.1.  Eq.  (3.49)  is  a  linear 
time- varying  ODE  and  its  solution  is  given  by  [61]:  H{t)  =  H0(t)  +  Hi(t),  where 

H0(t)  =  e~f(>  ‘od+»<mdaH(0),  (3.50) 

Hi(t)=  [  e~f*  =oO +W»d<T — £l - I(t)cIt.  (3.51) 

Jo  1  +  9(r) 

From  (3.44),  one  obtains  ||  H0{-)  ||o<  bo ■  Again  from  (3.44), 


A 


Hi{t)\  <  H2(t)  =  e  «o(W0Cl|J(i)|, 


(3.52) 


where  “<S>”  denotes  the  convolution.  Note  H2  is  just  the  output  of  a  linear  time 


invariant  system 


Gq(s) 


Cl 


s  — I- 


Cl 

co(  1 +C) 
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when  the  input  is  |/(-)|.  Denote  Fourier  transforms  of  H2,  I  and  |/|  by  H2(jco), 
I(ju)  and  I'{ju ),  respectively.  Then 


ll^iOHl  <  ||  //,(-)  Hi}  (3.53) 

—  —  J  \Go(ju)I  (ju)\2du>  (3.54) 

<  (3.55) 

=  f  T  /  J/(j«)|3(Ju*  (3.56) 

=  f  II  /(Ollt  (3.57) 

where  we  have  used  the  ParsevaTs  identity  [39]  in  (3.54),  (3.56)  and  (3.57).  Eq.  (3.48) 
now  follows  by  using  the  triangular  inequality.  ■ 


Remark  3.4.6  The  bound  y  is  just  the  7fox  no?’?/?,  [39]  of  the  system  Gq(s). 

3.4.3  Reachability 

In  this  subsection  and  next  subsection,  we  will  adapt  the  notions  of  reachability 
and  observability  [74]  to  the  Preisach  operator  P  and  the  system  (3.2). 

Let  H  e  C([0,r])  be  the  input  to  the  Preisach  operator  and  let  t[[t]  be  the 
memory  curve  at  time  t.  It’s  easy  to  check  that  ?/>[•]  is  continuous  on  [0,T]  under 
any  of  the  three  metrics  on  4  defined  in  Subsection  3.4.1,  and  we  write 

#]eC([o,r],«). 

Let  E  :  C([0,r])  x  4?  — ,  C([0,  T],  4?)  be  the  evolution  map  of  the  memory  curve 
(c.f.  Section  2.3  for  the  definition  of  Ed  in  the  discrete-time  case),  i.e.,  for  the  input 
H  £  C([0,T])  and  the  initial  memory  curve  o,  £  4?, 

i’[t]  =  e|//.  /.’(*](/ ). 
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Definition  3.4.7  (Reachability  for  the  Preisach  operator)  Let  ipi,tp2  £  ' F- 

We  say  -ip2  is  reachable  from  ipi  if  there  exists  a  finite  T  >  0,  and  H  G  C([0,  T]), 
such  that  'ijj‘2  =  E[H,ipi\(T).  The  state  space  k  is  called  reachable  if  any  state  is 
reachable  from  any  other  state. 

Definition  3.4.8  (Approximate  reachability  for  the  Preisach  operator) 

Let  be  one  of  the  three  metrics  in  k  defined  in  Subsection  3. 4.1.  We  say 

if‘2  G  k  is  approximately  reachable  from  ifi  G  4/  if  for  any  e  >  0,  there  exists 
tfe  G  4/,  such  that  xfe  is  reachable  from  fi\  and  dfiifip&fi  <  e.  The  state  space  4/ 
is  called  approximately  reachable  if  any  state  is  approximately  reachable  from  any 
other  state. 

Remark  3.4.9  The  definition  of  approximate  reachability  above  is  adapted  from 
that  in  [37]. 

Proposition  3.4.10  [37]  For  the  Preisach  operator  F,  the  state  space  4/  is  not 
reachable,  but  approximately  reachable. 

Proof  Denote  kr  C  4/  the  set  of  memory  curves  composed  of  segments  with 
slope  ±1  in  (r,  s)  coordinates.  It's  easy  to  check  that  kr  is  a  dense  subset  of  k. 
and  any  yjr  G  k,.  is  reachable  from  any  fi>  G  k.  ■ 

Now  for  the  system  (3.2),  for  any  input  /  G  PC([0,T})  (the  space  of  piecewise 
continuous  functions),  the  corresponding  trajectory  fi[-}  G  C([0, T],k)  by  Theo¬ 
rem  3.2.2.  Denote  ED  :  PC([0,T})  x  k  — >  (7([0,r],k)  the  state  evolution  map  for 
(3.2). 

Definition  3.4.11  (Reachability  and  approximate  reachability  for  (3.2)) 

Same  as  Definition  3-4-7  and  Definition  3-4-8,  except  we  replace  H  G  C([0,T ])  and 
E  by  I  G  PC([0,T ])  and  ED,  respectively. 
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Proposition  3.4.12  Let  the  Preisach  measure  be  nonnegative  and  nonsingular. 
The  state  space  W  for  (3.2)  is  not  reachable,  but  approximately  reachable. 

Proof  T  is  not  reachable  since  the  state  space  for  the  Preisach  operator  is  not 
reachable.  We  now  show  W  is  approximately  reachable,  i.e.,  given  bi,  b>2  €  4, 
e  >  0,  there  exists  I  G  PC([ 0,T]),  such  that 

E  D[I,il>i](T)=il>r£%,  (3.58) 

with  d(ipr,ip 2)  <  e.  Indeed,  from  the  rate-independence  property  of  the  Preisach 
operator  and  Proposition  3.4.10,  we  can  find  H  £  C'1([0,T])  (the  space  of  continu¬ 
ously  differentiable  functions),  such  that  E [H,  ipi]{T)  =  ipr  £  Tr  with  d(ipr,  ip2)  <  e. 
By  the  hypothesis,  M  =  G  C([0,2’])  and  it  is  a.e.  differentiable.  Then  by 

the  uniqueness  of  the  solution  to  (3.2),  the  input 

r  H  +  M  H 
I  = - +  — 

Cl  Co 

satisfies  (3.58). 

■ 

3.4.4  Observability 

Definition  3.4.13  (Observability  for  the  Preisach  operator)  We  say  Lp\  G 

T  is  distinguishable,  from  b’2  G  T,  if  there  exists  a  finite  T  >  0  and  H  G  C'QO,  T] ) , 
such  that  Mi(t')  ±  M2(t!)  for  some  t'  G  [0,  T],  where  Mt  =  T{H,^i],  i  =  1,2.  The 
Preisach  operator  is  observable  if  any  state  ip  G  T  is  distinguishable  from  any  other 
state. 

Proposition  3.4.14  Recall  the  definition  of  fy,  (Subsection  2.1.1).  Let  the  Preisach 
measure  be  nonnegative,  and  nonsingular  with  density  //.  The  Preisach  operator  is 
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observable  if  and  only  if\/iPi,ip2  G  VL  and  ip i  fi  ip2, 


H(P, «) I4i  03,  a)  -  Cv>2 09, « 


'a  >  0. 


(3.59) 


Proof  If  (3.59)  holds,  straightforward  analysis  on  the  Preisach  plane  shows  any 


state  can  be  distinguished  from  any  other  state.  Conversely,  if  (3.59)  is  violated 
for  some  pair  ipi,  iP 2,  then  one  can  find  ip{  ^  ip'2  both  intersecting  the  line  a  =  (3 
at  the  same  point  and 


//( J.o)|C.-;(d.n  )  -  (ip>2(P,a)\dpda  =  0. 


(3.60) 


It’s  obvious  that  ip\  is  not  distinguishable  from  ip'2-  m 

As  in  Subsection  3.4.2,  we  take  /(■)  as  the  input  and  {//(•),  M(-)}  as  the  output 
of  the  system  (3.2). 


Definition  3.4.15  (Observability  for  (3.2))  Let  ipi,if2  £  XI;.  We  say  xp\  is 

distinguishable  from  ip2 ,  if  there  exists  a  finite  T  >  0  and  I  G  PC([0,T]),  such 
that  H[{t')  H2{t')  or  M2{t')  for  some  t'  G  [0,  T] .  The  system  (3.2)  is 

observable  if  any  state  ip  G  'f  is  distinguishable  from  any  other  state. 


Proposition  3.4.16  Let  the  Preisach  measure  be  nonnegative ,  and  nonsingular 
with  density  /i.  The  system  (3.2)  is  observable  if  and  only  if\/ipi,ip2  G  4/  and 
ipi  7^  ip2, (3.59)  holds. 

Proof  We  first  show  if  (3.59)  holds,  (3.2)  is  observable.  Let’s  consider  ipi,ip2  G  lL 
such  that  H\  (0)  =  H2( 0)  and  Mi(0)  =  M2( 0)  (otherwise  ipi  is  already  distinguished 
from  ip2  by  taking  t'  =  0).  From  (3.59), 

A/i  (0)  —  A/o(0)  =  2  /  /  u>(r,  s)dsdr  —  2  /  /  co(r,  s)dsdr  +  Voo  =  0,  (3.61) 

J  JD 1  J  J  D-2 
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s 


s 


Figure  3.7:  Illustration  of  the  proof  of  Proposition  3.4.16. 

where  D\  and  D%  are  regions  as  illustrated  in  Figure  3.7(a)  and  Uoo  represents 
the  contribution  from  the  region  r  >  r*.  Applying  a  monotonically  increasing  (or 
decreasing,  resp.)  input  I  with  1(0)  >  (/( 0)  <  - ^21,  resp.).  Then 

until  at  some  time  i,  the  segment  rn-2  (m i  ,  resp.)  is  touched  and  the  area  Dy  starts 
to  change  (Figure  3.7  (b)).  This  breaks  the  balance  in  (3.61)  and  we  will  observe 
Mi(t')  ^  M2(T)  for  some  t'  >  t. 

Conversely,  if  (3.59)  is  violated  for  some  Thyth  €  \Eh  Then  as  in  the  proof  of 
Proposition  3.4.14,  we  can  find  ifA  ^  T’2  and  they  satisfy  (3.60)  and  the  correspond¬ 
ing  H(( 0)  =  H!2(0).  Then  we  can  show  (using,  e.g.,  the  Euler  polygon  method), 
for  any  /(•)  G  PC([ 0,  T]),  {H(,  M[}  =  {//(,  M!2}.  ■ 
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3.5  Existence  of  Periodic  Solutions  under  Peri¬ 


odic  Forcing 

We  observe  a  periodic  motion  of  the  actuator  head  when  a  periodic  input  is  applied 
(Figure  3.1).  We  want  to  investigate  whether  the  model  (3.2),  (3.3)  has  this  same 
property.  Eq.  (3.3)  is  a  linear  system  when  we  treat  M2(t)  as  its  input,  and  it’s 
easy  to  show  it  has  an  asymptotically  orbitally  stable  periodic  solution  [54]  when 
M2  is  periodic.  Therefore  we  need  only  study  whether  (3.2)  has  periodic  solutions 
when  the  input  /  is  periodic. 

Brokate  and  Pokrovskii  studied  asymptotic  stability  of  oscillations  in  nonlinear 
ODE  systems  with  small  hysteretic  perturbations  [19],  where  the  hysteresis  non¬ 
linearity  is  required  to  satisfy  certain  contraction  property.  Studies  on  oscillations 
in  systems  with  hysteresis  can  also  be  found  in  [56,  14,  66].  Techniques  from  these 
papers  can  not  be  directly  applied  to  (3.2),  but  the  key  idea  of  finding  a  fixed  point 
of  some  operator  in  these  papers  proves  to  be  useful  in  our  following  result: 

Theorem  3.5.1  Let  the  Preisach  measure  be  nonnegative  and  nonsingular.  Define 
Jj  =  [-^-W  Let  I  €  Cjj{[ 0,  oo))  be  T -periodic,  i.e., 

I(t  +  T)  =  I(t),  Vi  >  0. 

Let  Ed  :  C'([0,  oo))  x$->  C'QO,  oo),  T)  be  the  state  evolution  map  for  (3.2).  Then 
there  exists  Do  €  T,  such  that  ET[/,Do](i  +  T)  =  ED[I,fio\(t),  Vi  >  0. 

Proof  We  will  use  the  Sehauder  fixed  point  theorem  (Appendix  A).  Recall 
Definition  2.1.1  and  Figure  2.4.  It’s  obvious  that  T  is  a  convex  set.  Denote 
Li([0,ro])  the  Banach  space  of  integrable  functions  on  [0,  ro].  First  we  show  T  is  a 


76 


closed  subset  of  Li([0,  r0]),  where  we  borrow  some  ideas  from  the  proof  of  Theorem 
3.3  in  [37]. 

In  (r,  s)  coordinates,  any  ip  G  T  is  a  continuous  function  of  r  on  [0,ro],  and 
thus  ip  G  Li([0,  ro]).  Let  {ipn  G  T}  be  a  sequence  that  converges  to  ip  G  Ll([0,  r0]) 
(in  the  L\  norm).  By  definition  of  'P  {ipn}  is  equicontinuous  and  equibounded. 
Therefore  by  the  Ascoli-Arzela  Theorem,  a  subsequence  ipnk  —*  ip  €  4/  uniformly 
on  [0,ro],  which  implies  {ipnk}  converges  to  ip  in  the  L\  norm.  Therefore  ip  =  ip 
and  T  is  closed. 

Given  ipo  G  T  and  a  T-periodic  I  G  Cjj{[ 0,'Oo)),  we  have 

’ElD[I,ipo\(t)  G  T,  Vt  >  0, 

from  Proposition  3.4.4.  We  then  define  the  map  :  4/  — »■  T  by 

=i)(V’ o)  =  =^d[I,iPo]{T),  \/ip0  G  T.  (3.62) 

The  metric  of  Li([0,  ro])  on  T  is  equivalent  to  the  metric  (3.36),  which  is  further 
equivalent  to  the  metric  (3.37)  under  the  assumptions  on  the  Preisach  measure. 
Hence  E7D  is  continuous  by  Theorem  3.2.8.  Also  EJD  is  a  compact  mapping  since 
T  itself  is  compact.  Therefore  ~?D  has  a  fixed  point  by  Schauder’s  fixed  point 
theorem,  and  this  completes  the  proof.  ■ 

Remark  3.5.2  Theorem  3.5.1  implies  that  the  corresponding  solution  {H(-),  M(-)} 
is  also  periodic. 
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3.5.1  Existence  of  recurrent  solutions 


Pokrovskii  and  his  colleagues  studied  existence  of  so  called  recurrent  oscillations 
in  the  differential-operator  equation  of  the  form  [64,  65]: 


|  x(t)  =  f(t,x(t),z(t)) 
\  z(t)  =  A[x(-),z(ta)](t) 


(3.63) 


where  Vt,  x(t)  G  Rd,  z(t)  typically  represents  the  internal  state  of  some  hysteresis 
operator  and  belongs  to  an  infinite  dimensional  metric  space  Z  with  the  metric 
dzi  and  A[-,  •]  is  the  evolution  map  for  the  state  z.  For  instance,  for  the  case  of 
the  Preisach  operator  r,  Z  =  4/  and  A  =  E. 

Roughly  speaking,  a  function  u(t)  with  —  oo  <  t  <  oo  is  recurrent  if,  given 
e  >  0,  T  >  0,  there  exists  .4  >  0,  such  that  Vr  G  (—00,00),  any  interval  longer 
than  A  contains  cr  such  that  the  function  t  — ►  u(t  +  r )  is  “e-close”  to  the  function 
t  —>  u(t  +  a ),  t  G  [—T,T\.  The  class  of  recurrent  functions  includes  periodic,  quasi- 
periodic,  almost-periodic  functions  and  many  more  [65].  It  was  shown  in  [64,  65] 
that  (3.63)  has  at  least  a  recurrent  solution,  if  f{t,,x,z)  is  recurrent  in  t  and  (3.63) 
lias  a  uniformly  bounded  solution.  One  can  adapt  the  proof  of  this  result  to  get 
the  following:  when  /  is  recurrent,  (3.19)  (and  thus  (3.2))  has  at  least  a  recurrent 
solution. 


Remark  3.5.3  One  should  not  say  the  result  of  existence  of  recurrent  solutions 
is  stronger  or  weaker  than  that  of  existence  of  periodic  solutions  (Theorem,  3.5.1). 
In  some  sense,  the  result  of  existence  of  recurrent  solutions  is  more  general  but  it 
does  not  imply  Theorem  3.5.1. 
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3.6  Numerical  Simulation  of  the  Model 


Numerically  solving  (3.2)  helps  predict  behaviors  of  the  model,  verify  theoretical 
analysis,  and  validate  the  model  by  comparing  the  simulation  result  to  the  exper¬ 
imental  measurement.  It  will  also  prove  useful  in  parameter  identification  for  the 
model. 

3.6.1  Explicit  Euler  algorithm 

The  Euler  polygon  method  was  used  in  establishing  the  well-posedness  of  (3.2)  in 
the  proof  of  Theorem  3.2.2.  Here  we  use  the  Euler  method  to  obtain  an  approx¬ 
imate  solution  to  (3.2).  Given  the  memory  curve  tp{t0)  at  time  t0  and  the  input 
/(•),  approximate  values  of  H  and  A I  at  to  +  h  are  computed  via 

—  ‘  l(  (,)‘“)  ,  (3.64) 

M(to  +  /i)  =  r[^(#o  +  /i),#o]] 

where  h  is  the  time  step  size  (see  the  comments  in  the  proof  of  Theorem  3.2.2  for 
proper  understanding  of  the  notation  in  (3.64)).  We  call  (3.64)  the  explicit  Euler 
scheme  since  H(t0  +  h)  is  not  involved  in  the  right-hand  side  of  the  first,  equation 
in  (3.64),  following  the  terminology  in  the  ODE  literature  [40]. 

As  noted  in  the  proof  of  Theorem  3.2.2,  (3.64)  has  a  unique  solution  if  the 
Preisach  measure  is  nonnegative  and  nonsingular.  Eq.  (3.64)  can  be  solved  by 
adapting  the  inversion  algorithms  for  T  discussed  in  Subsection  2.3.2.  Denote  the 
right-hand  side  of  the  first  equation  in  (3.64)  as  go-  We  consider  the  case  go  >  0 
and  the  other  case  can  be  dealt  with  similarly. 

If  the  Preisach  measure  satisfies  the  assumptions  of  Proposition  2.3.4,  the  fol- 
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lowing  algorithm  can  be  used  to  solve  (3.64): 


H(n+1)  =  H(n)  + 

L+V 

M(n+ 1)  =  r[//(n+1Vdi(n)] 


(3.65) 


^  ](n+ 1)  _  j(n)  _  (]j(n+ 1)  _  fj{A^  _  (A/(n+L)  —  , 

with  v  as  defined  in  (2.20),  /7(0)  -  H(t0),  M(0)  =  M(t0),  /(0)  =  hg0. 

If  the  density  p  is  piecewise  uniform,  obtained  from  the  collection  of  identified 
weighting  masses  as  discussed  in  Subsection  2.3.2,  the  algorithm  (2.22)  can  be 
modified  to  solve  (3.64).  At  iteration  n,  evaluate  d\a\  and  aj,"1  as  in 


Subsection  2.3.2.  Then  solve 


4n)(4n))2  +  (l  +  a(1n)K  =  ^) 

for  where  =  hg0,  and 

l(n+l)  =  j(n)  _  (H(n+l)  _  _  ( M(n+1)  _  M(n)),  Vri  >  0. 


Then  iteration  of  (2.22)  will  yield  the  solution  to  (3.64). 


3.6.2  Accuracy  of  the  Euler  algorithm 

We  have  the  following  result  about  accuracy  of  the  algorithm  (3.64): 

Proposition  3.6.1  Assume  that,  the  Preisach  measure  is  nonnegative,  and  non- 
singular  with  a  piecewise  continuous  density.  Assume  that  the  input  /(•)  is  contin¬ 
uous  and  bounded.  Consider  the  algorithm  (3.64).  Let  the  true  solution  to  (3.2) 
be  {H (•),  Af (•)}.  Assume  ^-(^[to],  ±)  and  the  derivatives  of  H(t)  and  M(t )  at  to 
exist.  Then 

\H(t0  +  h)-H(t0  +  h)\  =  O(h2), 

\M(t0  +  h)  -  M(t0  +  h) |  =  0(K2). 
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(3.66) 

(3.67) 


Proof  Denote  go  the  right-hand  side  of  the  first  equation  in  (3.64).  Taylor  series 
expansion  of  H  and  M  at  to  gives  us 


H(t0  +  h)  =  H{to)  +  H(t0)h  +  0(h 2), 
M(t0  +  h)  -  M(to)  +  M(to)h  +  0(h 2), 


where 


and 


H(t0) 


M(to) 


go 


1  + 

clM 


dH 


(ip[to},sgn(g0))H(to). 


From  (3.64)  and  the  piecewise  monotonicity  property  of  F. 


\H(t0  +  h)  -  H(t0) |  <  h\g0\  <  hC , 


for  some  constant  C  >  0.  From  this  we  have 


(3.68) 

(3.69) 


(3.70) 


M (t0  +  h)  -  A/ (t0) 

d  M  ~  ~ 

—  (D[f0],  sgn(go))(H(to  +  h)  -  H(t0))  +  O(\H(t0  +  h)  -  H(t0) |2) 

d  M  _ 

sgn(go))(H{t.0  +  h)  -  H(t0))  +  0(fi2).  (3.71) 


Combining  (3.64)  and  (3.71),  we  have 


H  (to  +  h ) 


H(t0)  + 


hgo 


1  +  TJT^-'lio}- sgn(go)) 


+  0(h s 


(3.72) 


Then  (3.68)  and  (3.72)  lead  to  the  estimate  (3.66),  while  (3.69),  (3.71)  and  (3.72) 
lead  to  (3.67).  ■ 

We  have  seen  the  local  error  (the  error  in  one  step)  for  the  algorithm  (3.64) 
is  0(h2)  and  thus  the  global  error  is  0(h).  This  is  consistent  with  the  accuracy 
order  of  the  Euler  method  in  numerical  integration  of  usual  ODEs. 
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A  natural  question  to  ask  is  whether  we  can  obtain  algorithms  of  high  order 
accuracy  for  solving  (3.2)  by  properly  adapting  high  order  integration  methods 
for  ODEs,  e.g.,the  mid-point  rule  and  other  Runge-Kutta  methods.  From  our 
preliminary  investigation,  the  answer  appears  to  be  “no” .  One  of  the  difficulties  is 
due  to  the  dependence  of  “yff”  on  the  sign  of  H. 


3.6.3  Implicit  Euler  algorithm 

Implicit  methods  perform  better  than  explicit  ones  for  many  problems,  especially 
for  stiff  problems  [41],  Existence  of  fast  transient  dynamics  in  a  system  is  a  typical 
cause  of  stiffness.  Eq.  (3.2)  is  stiff,  which  one  can  see  easily  after  it  is  rewritten  as 
(3.49).  Therefore  we  propose  an  implicit  Euler  algorithm  to  solve  (3.2): 


(3.73) 


H(to+h)-H(t0)  ,  M(t0+h)-M(t0)  „  ,ru  ,  io  H(to+h)^ 

- h - H - h - -  Ci(l(t0  +  h) - —) 

M(to  +  h)  =  r[H(to  +  h),i'M 

Solving  (3.73)  does  not  require  more  effort  than  solving  (3.64)  since  (3.73)  can 
be  rewritten  as 


f  <i  +  %)m o + />)  -  h(u))  +  =  c,< 7(i0 + ft)  -  sm) 

(  M(t„  +  ft)  =  r[5(t0  +  /»),ift[<o]] 


which  carries  the  same  structure  of  (3.64).  Figure  3.8  compares  the  performance  of 
the  explicit  scheme  (3.64)  and  the  implicit  scheme  (3.73).  Same  input  I  is  applied. 
The  step  size  h  =  8  x  10-5  second.  We  can  see  that  the  implicit  algorithm  is  much 
more  stable  and  it  can  provide  meaningful  solutions  even  if  h  is  not  very  small. 
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Figure  3.8:  Comparison  of  the  implicit  Euler  scheme  with  the  explicit  Euler  scheme. 


(a):  the  input  current;  (b),  (d):  trajectories  of  H,  M  computed  by  the  explicit 
scheme;  (c),  (e):  trajectories  of  //,  M  computed  by  the  implicit  scheme. 
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Figure  3.9:  Displacement  amplitude  vs.  input  frequency. 

3.7  Parameter  Identification 

In  this  section  we  discuss  how  to  identify  parameters  involved  in  the  model  (3.2) 
and  (3.3).  The  Preisaeh  measure  is  identified  as  described  in  Section  2.2.  The 
following  parameters  are  provided  by  the  manufacturer  (some  of  them  presented 
already  in  Section  2.2):  Nm  =  1300.  Am  =  2.83  x  10-5m2,  lrod  =  5.13  x  10_2m, 
Ms  =  7.87  x  105A/m,  co  =  1.54  x  104/m.  The  saturation  magnetostriction  Xa 
is  identified  to  be  0.001313.  To  estimate  the  first  resonant  frequency,  we  apply 
sinusoidal  inputs  of  the  same  amplitude  but  different  frequencies  and  measure  the 
amplitudes  of  the  displacement.  Figure  3.9  displays  the  displacement  amplitudes 
at  different  frequencies  and  we  determine  the  first  resonant  frequency  to  be  392 
Hz. 

We  are  now  left  with  identification  of  Reddy  and  £.  Generally  it’s  impossible 
to  write  down  the  explicit  solution  of  (3.2)  in  terms  of  Reddy,  therefore  we  can 
not  identify  Reddy  directly.  A  theoretical  value  of  Reddy  can  be  computed  with  the 


84 


formula  Reddy  =  ~  [82] ,  where  p  is  the  resistivity  of  the  magnetostrictive 

material,  b  and  a  are  the  outer  and  inner  radii  of  the  magnetostrictive  rod.  We  use 
this  formula  to  obtain  an  upper  bound  R  of  Re(idy  by  letting  a  —  0.  Plugging  in 
p  =  5.8  x  10  7il/u\  for  Terfenol-D,  we  get  R  —  480.212.  We  then  discretize  [0,  R] 
and  denote  the  mesh  points  by  R^dy,  %  =  1,  •  •  •  ,  N.  The  discretization  need  not 
be  uniform  and  we  make  it  finer  in  the  region  where  the  dynamics  of  (3.2)  is  more 
sensitive  to  Re.ddy 

We  observe  a  periodic  motion  of  the  actuator  head  when  a  periodic  input 
is  applied.  We  have  also  shown  (3.2)  and  (3.3)  have  periodic  solutions  if  /(•) 
is  periodic  (Section  3.5)  .  These  observations  motivate  the  following  scheme  to 
identify  Reddy  and  £: 

•  Step  1.  We  apply  a  sinusoidal  current  (with  some  dc  shift  if  necessary)  /(•) 
with  frequency  /  to  the  actuator  and  measure  the  phase  lag  9yj  between  the 
fundamental  frequency  component  of  the  displacement  and  the  current; 

•  Step  2.  For  each  R^idy,  we  numerically  integrate  (3.2)  with  /(•)  as  the 
input,  and  calculate  the  phase  lag  9m2,i  between  the  fundamental  frequency 
component  of  M2(-)  (in  its  steady  state)  and  /(•). 

•  Step  3.  The  difference  9yj  —  9m2j  is  considered  to  be  the  phase  lag  between 
the  fundamental  frequency  component  of  y(-)  and  that  of  M2(-)  in  (3.3), 
from  which  we  can  compute 

Remark  3.7.1  The  idea  of  relating  the  phase  shift  between  the  output  and  the 
input  to  hysteresis  can  also  be  found  in  [28].  We  note  that  in  general ,  the  phase 
lag  depends  highly  nonlinearly  on  the  initial  condition,  and  the  amplitude  and  the 
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frequency  of  I (•),  so  we  should  make  sure  that  the  initial  condition  in  simulation 
is  consistent  with  the  condition  in  the  experiment. 

We  repeat  the  above  experiment  (Step  1  to  Step  3)  K  times  with  different  input 
frequencies  and  denote  the  damping  coefficients  as  for  R^}dv-  If  R^uiy  is 

close  to  the  true  parameter  Reddy,  f{‘'>  should  not  vary  much  with  k.  Therefore  we 
pick  i*  G  {1,  •  •  •  ,  N}  such  that  {£^  '  }jf=]  has  the  minimum  variance,  and  estimate 
Reddy  via  Reddy  =  R^Jy  and  let  £  be  the  mean  of 

Figure  3.10  shows  the  variation  of  £  with  respect  to  frequency  for  different 
Reddy8’  The  parameters  are  determined  to  be  Reddy  =  700,  £  =  0.7783.  Figure  3.11 
compares  the  rate-dependent-  hysteresis  loops  measured  in  experiments  to  those 
obtained  through  simulation  based  on  the  identified  parameters.  We  see  that  the 
simulation  results  agree  with  the  experimental  results  reasonably  well  up  to  200 
Hz.  Since  the  depth  of  eddy  current  penetration  depends  on  the  frequency,  so 
does  Reddy  This  explains  why  the  comparison  in  Figure  3.11  goes  worse  when  the 
frequency  is  beyond  200  Hz.  In  practice,  one  can  identify  Reddy  according  to  the 
operating  frequency  range  of  the  specific  application. 

3.8  An  Inverse  Control  Scheme 

In  this  section  we  propose  an  inverse  control  scheme  for  the  dynamic  hysteresis 
model  (3.2)  and  (3.3).  We  first  formally  describe  the  scheme  to  highlight  the  idea, 
then  we  discuss  how  to  implement  it. 

Given  a  desired  displacement  trajectory  y(-)  G  (72([0,  T]),  we  compute  for  every 
t,  hit)  =  -2f r*  t-  ( y(t )  +2 fuj0y(t)  +  u&y(t))  and  then  let  M(t)  =  \/u{t).  Next  we 
obtain  H(-)  from  M(-)  by  inverting  the  Preisach  operator  T.  We  then  (formally) 
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Figure  3.10:  Identification  of  Reddy  and  £. 
let 

I(t)  =  -(S(t)  +  M(t)  +  ^).  (3.75) 

Cl  Co 

Due  to  the  uniqueness  of  the  solution  to  (3.2)  and  (3.3),  we  expect  the  output  y(-) 
under  /(•)  to  agree  with  y(-). 

All  we  have  just  said  is  the  ideal  case.  Several  issues  need  to  be  taken  care  of 
in  implementing  the  scheme. 

First  of  all,  the  desired  trajectory  y(-)  may  not  be  twice  differentiable.  For  (3.3), 
let  £>([0,T])  be  the  space  of  attainable  y(-)  under  some  control  u(-)  G  C([0,  T]) 
and  0  <  u(t)  <  M;,  \/t  G  [0,T]  (u  plays  the  role  of  M 2  in  (3.3)).  In  general, 
we  need  first  find  y*(-)  G  D([0,T])  which  is  closest  to  y(-)  in  the  sup  norm  (i.e. , 
the  projection  of  y(-)  on  ^([O,!1]))  and  then  work  with  ?/(•).  In  our  experiments 
below,  however,  y  is  picked  from  D([0,T])  since  our  main  objective  is  to  validate 
the  model. 

Since  (3.3)  is  a  linear  system,  sometimes  u(-)  (and  therefore  M(-))  is  available 
as  the  output  of  some  linear  controller.  In  such  cases,  the  inverse  problem  becomes 
how  to  get  /  from  M  and  the  solution  to  it  is  just  (3.75). 
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Figure  3.11:  Model  validation.  Solid  line:  experimental  measurement;  Dashed  line: 
numerical  prediction . 

Another  question  is  that  M(-)  or  H(-)  may  not  be  differentiable.  In  general 
this  should  not  bother  us  because  we  work  in  the  discrete-time  setting  (for  digital 
computer  control)  and  the  derivatives  are  approximated  by  the  finite  difference 
method. 

Three  inverse  control  schemes  have  been  implemented  to  track  a  desired  dis¬ 
placement  trajectory.  The  first  one  is  based  on  the  dynamic  hysteresis  model, 
the  second  one  is  based  on  the  Preisach  model  alone  (c.f.  Section  2.3),  and  the 
third  one  is  based  on  the  non-hysteretic  model  described  in  Subsection  2.4.3.  The 
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sampling  period  used  is  0.05  ms.  Experimental  results  are  shown  in  Figure  3.12  - 


3.14.  In  each  figure,  the  displacement  trajectories  (both  the  desired  and  the  mea¬ 
sured),  the  tracking  error  and  the  input  current  are  displayed.  We  can  see  that  the 
performance  of  the  first  scheme  is  very  satisfactory.  This  shows  that  the  dynamic 


hysteresis  model  can  capture  high  frequency  effects  in  the  actuator,  and  that  our 


identification  and  inverse  control  schemes  are  effective. 


Figure  3.12:  Trajectory  tracking  based  on  the  dynamical  hysteresis  model. 


Time  (see.) 


Figure  3.13:  Trajectory  tracking  based  on  the  Preisach  model  alone. 
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Figure  3.14:  Trajectory  tracking  based  on  the  non-hysteretic  model. 
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Chapter  4 


A  Robust  Control  Framework  for 
Smart  Actuators 


Due  to  the  open  loop  nature  of  inverse  compensation,  its  performance  is  susceptible 
to  model  uncertainties  and  to  errors  introduced  by  the  inverse  schemes.  In  this 
chapter  we  address  this  problem  by  combining  inverse  control  with  robust  control 
techniques.  Appendix  C  provides  the  background  (and  notation)  on  robust  control 
necessary  for  development  of  results  in  this  chapter. 

Figure  4.1  illustrates  the  idea  underlying  the  robust  controller  design  method 
for  smart  actuators.  We  consider  the  discrete-time  setting  in  the  interest  of  digital 
control.  W  and  Ga( A)  represent  the  nonlinear  part  and  the  linear  part  of  the 
actuator  model,  respectively.  W  could  be  a  Preisach  operator  (rate-independent 
hysteresis),  or  a  rate-dependent-  hysteret-ic  operator,  like  (3.2),  together  with  other 
nonlinearities,  e.g.,  the  square  operator  in  Figure  3.2.  We  recall  (see  Appendix  C) 
that  A-transform  G( A)  of  a  LTI  system  G  is  essentially  the  usual  ^-transform  of 
G  with  A  =  z~l.  G'o(A)  denotes  the  plant  we  want  to  control.  An  approximate 
(right)  inverse  IV  1  is  connected  in  series  to  W,  to  approximately  cancel  out  the 
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Figure  4.1:  A  robust  control  framework  for  smart  actuators. 

nonlinearity  W.  Then  a  linear  controller  K{ A)  is  designed  for  the  composite  plant 
Go  Go-  In  addition,  actuator  saturation  is  also  considered  in  Figure  4.1. 

As  an  example,  we  study  the  robust  trajectory  tracking  problem.  The  require¬ 
ments  for  the  controller  can  be  roughly  stated  as:  in  the  presence  of  the  inversion 
error  eu  =  u  —  u  and  the  model  uncertainties  in  Ga  and  Go,  for  all  desired  trajec¬ 
tories  in  a  certain  class, 

•  the  closed-loop  system  is  stable, 

•  the  tracking  error  is  minimized,  and 

•  the  output  of  K  does  not  exceed  the  saturation  limits. 

A  more  precise  formulation  of  the  robust  control  problem  will  be  presented  in 
Section  4.2. 

Remark  4.0.1  We  take  the  saturation  limits  into  account  in  the  design  of  K 
to  ensure  that  the  overall  system  operates  in  the  linear  region  and  thus  predictions 
based  on  the  linear  design  are  credible.  We  will  see ,  however ,  that  strictly  enforcing 
the  saturation  constraint  at  the  stage  of  controller  design  compromises  the  tracking 
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Figure  4.2:  Two  ways  to  represent  the  inversion  error. 

performance.  Further  discussions  on  how  to  incorporate  the  saturation  nonlinearity 
into  controller  design  will  be  provided  in  Section  4-4- 

A  first,  step  toward  the  robust  controller  design  is  to  quantify  the  inversion  error 

€U' 


4.1  Quantification  of  the  Inversion  Error 

In  general  TV-1  is  not  an  exact  inverse  of  IT  (Figure  4.1)  and  two  factors  may 
contribute  to  the  inversion  error:  parameter  uncertainties  and  non-existence  of 
exact  inverse  schemes. 

There  are  two  possible  ways  to  model  eu.  The  first  one  is  to  model  it  as  the 
output  of  some  uncertainty  block  A  (Figure  4.2(a))  and  the  other  one  is  to  simply 
model  it  as  an  exogenous  disturbance  v  (Figure  4.2(b)).  As  we  will  see  shortly,  eu 
is  independent  of  u  and  it  is  possible  that  eu  ^  0  for  u  =  0.  Therefore  there  exists 
no  stable  A  such  that  eu  =  A u,  and  we  will  treat  eu  =  v  as  an  external  noise. 

We  need  specify  the  signal  spaces  for  quantification  of  the  inversion  error.  The 
inversion  error  for  the  Preisach  operator  is  bounded  in  magnitude  instead  of  in 
energy.  Hence  a  natural  choice  for  the  signal  spaces  is  loo  and  not  h-  Also  it  is  more 
appropriate  to  use  l0 0  for  the  desired  trajectory  and  the  tracking  error.  Another 
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Figure  4.3:  The  error  in  inversion  of  the  Preisach  operator. 

advantage  of  using  loo  for  signals  is  that  the  actuator  saturation  constraint  can  be 
easily  handled  in  the  corresponding  lx  robust  control  theory,  while  it’s  very  hard 
to  be  formulated  in  Hoc  control  theory. 

We  first,  quantify  the  error  in  inversion  of  the  Preisach  operator,  and  then  con¬ 
sider  a  new  inversion  scheme  for  the  dynamic  hysteresis  model  (3.2)  and  quantify 
the  error  introduced  by  this  scheme.  In  both  cases  we  are  concerned  with  quan¬ 
tification  of  cm  =  M  —  M,  where  M  and  M  denote  the  trajectories  of  achieved 
magnetization  and  desired  magnetization,  respectively.  Finally  we  indicate  how  to 
obtain  eu  from  cm  when  a  square  nonlinearity  is  included  in  IF. 

4.1.1  Error  in  inversion  of  the  Preisach  operator 

Consider  Figure  4.3,  where  T  is  a  Preisach  operator  with  nonsingular  Preisach 
measure  u. 

Error  due  to  an  inversion  scheme 

Assume  we  are  given  the  Preisach  measure.  Consider  the  inversion  algorithm 
(2.21)  with  the  stopping  criterion  \M^>  —  M\  <  e.  Then  it’s  straightforward  that 
II  cm  ||oo<  (  for  any  M  £  Zoo- 
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Error  due  to  the  parameter  uncertainty 


If  the  Preisach  measure  is  not  given,  we  can  discretize  the  Preisach  plane  and 
identify  a  collection  of  weighting  masses,  as  discussed  in  Subsection  2.2.2.  We  can 
then  obtain  a  nonsingular  Preisach  measure  vp  with  a  piecewise  uniform  density 
Up  by  distributing  each  weighting  mass  uniformly  over  the  corresponding  cell  in 
the  discretization  grid.  We  have  presented  an  exact  inversion  scheme  (2.22)  for  the 
Preisach  operator  T  with  measure  up  (Subsection  2.3.2).  For  the  inverse  algorithm 
(2.22),  cm  can  be  attributed  to  the  measure  uncertainty  \up  —  u |.  We  now  quantify 
6m  in  terms  of  the  identification  error  and  the  discretization  level. 


Proposition  4.1.1  Let  the  true  Preisach  measure  u  be  nonnegative,  and  nonsin¬ 
gular  with  density  //.  Assume  p(/3,a)  <  p,  for  any  (/?,«)  in  the  Preisach  plane, 
where  p  >  0  is  a  constant.  Let  H,  =  cto  —  do,  where  [doWo]  is  the  input  range  of 
the  Preisach  operator.  Given  a  discretization  of  level  L.  denote  the  integral  of  p 
over  a  cell  i  (either  square  or  triangular)  as  zy°,  1  <  i  <  Ay,  where  Nc  is  the  total 
number  of  cells.  Consider  the  measure  identification  scheme  in  Subsection  2.2.2 
and  denote  by  i y  the  identified  mass  for  cell  i,  1  <  i  <  Ay.  Assume  that  the  relative 


error  in  identification  is  Sr,  he., 


Pi  ~  W 
if- 


<  Si,  1  <i<  Ay. 


Assume  that  the  initial  memory  curve  do  ^  vh  is  given.  Then  for  any  M  €  l0 Q,  any 


Wo  e  'F, 


eM 


oo <  SjMs  + 


2  pHf 
L 


where  Ms  is  the  positive  saturation  corresponding  to  u. 


(4.1) 


Proof  Define  pp  as  discussed  earlier.  We  obtain  another  Preisach  measure 
with  a  piecewise  constant  density  pp  by  distributing  iff  uniformly  over  the  cell  i, 
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Figure  4.4:  Illustration  of  the  proof  of  Proposition  4.1.1  (L  =  8). 


1  <  i  <  Nc.  To  distinguish  the  Preisach  operators,  we  will  put  the  corresponding 
density  as  the  subscript  of  T,  e.g.,  r#(  means  the  Preisach  operator  with  the  density 

/i. 

Given  M  G  and  d’o?  we  denote  the  output  of  T-1  in  Figure  4.3  as  H.  Then, 
Vfc  >  0, 

\eM[k]\  =  \T,[H,  c-o]M-r/;r,[/Tt/-o][A:]| 

<  -  r,o [h,  v'0][A*]|  +  \r,o[H,Mk]  -  (4-2) 

All  three  Preisach  operators  involved  in  (4.2)  share  the  same  memory  curve 
Wk  >  0.  It’s  obvious  that  the  second  term  of  (4.2)  is  bounded  by  5jMs.  To  bound 
the  first  term,  we  note  that  for  any  k  >  0,  the  memory  curve  t/j[k]  spans  L  —  1 
square  cells  and  one  triangular  cell  (Figure  4.4).  Any  cell  not  touched  by  t[)[k]  will 
contribute  the  same  amount  to  Tfl[H,'ipo][k]  and  T^o [H, ipo] [k] .  Hence  the  first  term 
of  (4.2)  is  bounded  by  twice  the  interal  of  //.  over  cells  spanned  by  ij)[k],  which  is 
further  bounded  by 

2////;J(b  -4)  2 fiH2r 

L2  <  L 

This  completes  the  proof.  ■ 

Remark  4.1.2  From  Proposition  4-1-1  >  the  bound  on  the  inversion  error  consists 
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of  two  parts:  the  first  part  is  proportional  to  the  relative  identification  error,  and 
the  second  part  is  inversely  proportional  to  the  level  L  of  discretization. 

Remark  4.1.3  The  assumption  that  i/jq  is  known  is  not  very  restrictive  since  in 
many  cases  we  have  the  choice  to  initialize  the  system.  On  the  other  hand,  if  Wq 
is  not  known  exactly,  we  can  easily  include  a  term  in  ||  <  m  H^.  which  takes  care  of 
the  uncertainty  in  i.p. 


4.1.2  Error  in  inversion  of  the  dynamic  hysteresis  model 


Given  a  desired  trajectory  of  magnetization,  we  proposed  an  inverse  control  scheme 
(3.75)  for  the  model  (3.2)  in  Section  3.8.  But  if  there  is  uncertainty  in  the  model 
parameters,  it  is  very  hard  (if  not  impossible)  to  derive  a  bound  for  the  inversion 
error.  Here  we  will  present  another  inversion  algorithm.  This  algorithm  leads  to 
an  inversion  error  even  if  the  exact  parameters  are  known,  but  it  will  allow  us  to 
quantify  the  inversion  error  when  model  uncertainty  is  considered. 

One  way  to  rewrite  (3.2)  is: 


*(*)  =  t*tW*)  -  ^)  ^  (4S) 

where  g(t)  =  sgn(H(t)))  and  0  <  g(t)  <  C  (c.f.(3.45)).  We  can  treat  the 

first,  equation  in  (4.3)  as  a  linear  time-varying  ODE  of  H ,  and  regard  (4.3)  as 
perturbed  from  the  following  decoupled  system: 


wt)  =  Ttj(m  -  ^) 
M(t )  =  r[/f,ifo](t) 


(4.4) 


where  g  E  [0,  C\  is  some  constant.  Based  on  (4.4),  an  approximate  inversion  scheme 
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for  (4.3)  is  given  formally  by 


1  H(t)  =  r-'[A/.Wo]«) 

{  Kt)  =  i ±lH(t)  +  f! 


(4.5) 


When  implementing  (4.5)  in  the  discrete  time,  we  have  two  ways  of  writing 
/[•],  which  correspond  to  the  explicit  Euler  scheme  and  the  implicit  Euler  scheme 
in  discretizing  the  first  equation  in  (4.4),  respectively:  for  k  >  0, 


1  +  9-(H[k]  -  H[k  -  1})  +  ^ 


Ci  h 


Co 


1  +  9 

Ci  h 


(H[k\-H[k-  1])  + 


Co 


(4.6) 

(4.7) 


where  h  is  the  time  step  size,  H{—  1]  =  H[ 0  . 


Remark  4.1.4  Direct  discretization  of  the  first  equation  in  (4-4)  by  the  explicit 
Euler  scheme  (a  similar  remark  applies  to  the  implicit  Euler  scheme )  gives: 

I{k]  =l-±l(H[k  +  l\  -H[k\)  + 

(  I I  I  Co 

but  this  is  not  a  causal  system  and  thus  not  realizable.  An  intrinsic  delay  is  intro¬ 
duced  in  the  inversion  due  to  the  dynamics  in  the  rate- dependent  hysteresis  model. 


We  now  want  to  study  the  errors  caused  exclusively  by  the  inversion  algorithms, 

i.e.,  we  assume  that  we  have  exact  values  of  parameters.  For  the  inversion  algorithm 

(4.6),  the  discrete  time  version  of  the  first  equation  in  (4.3)  is  obtained  by  the 

explicit  Euler  scheme: 

H[k  +  1]  -  H[k]  _  ci 

h  1  +  g[k] 

where  g[k]  =  g(k.h).  Similarly,  if  the  inversion  algorithm  (4.7)  is  used,  we  will 
use  the  corresponding  discrete-time  model  obtained  by  the  implicit  Euler  scheme. 
Figure  4.5(a)  shows  the  problem  setup  for  the  explicit  Euler  case. 


m  -  — ). 

Co 


(4.8) 


For  the  purpose  of  deriving  the  bound  on  the  inversion  error,  we  will  not  need 
the  exact  values  of  g  [A’] . 

Due  to  the  delay  caused  by  the  inversion,  the  error  cm  is  now  defined  as  (Fig¬ 
ure  4.5(d)): 

eM[k]  =  M[k]  -  M[k  -  1]. 


Proposition  4.1.5  Let  the  Preisach  measure  he  nonnegative ,  and  nonsingular 
with  a  piecewise  continuous  density  /<.  Let  the  Preisach  operator  V  be  Lipschitz 
continuous  with  Lipschitz  constant  LtJ .  Let  Hrn  =  max{|d0|,  | ci 0 1 }  where  [d0,  o.q]  is 
the  input  range  ofT.  Consider  the  inversion  algorithm  obtained  from,  the  explicit 
Euler  method  ( Figure  4.5(a)).  Let  H[—  1]  =  H[ 0]  =  H[ 0].  Pick  g  G  [0,(7],  where 
C  is  the  constant  as  defined  in  (3.44)-  Then  for  any  M  G  loo,  for  any  fit  G  lF, 


cm 


oC  2  L^  z  e  Hm , 


(4.9) 


where 


maxjy, 


le  = 


1  -  niaxl  —  -1,1- 

L  CO 


he i 


coP+C) 


} 


The  optimal  g  to  minimize  je  is 


c 

0+2  • 


Proof  We  first  derive  a  bound  for  e# ,  defined  by  en[A:]  =  H[k]  —  H[k  —  1],  k  >  0. 
Substituting  (4.6)  into  (4.8),  we  have 


eH[k  +  1]  =  a[k}eH[k]  +  b[k](H[k]  -  H{k  -  1]), 


(4.10) 


where 


r,  I  A  ^  hCX 

a\k\  =  1  — 


b\k]  =  9 


co(l  +  g[k])  1  +  g  k 


From  (4.10),  we  compute 


e«(*'+l]  =  (rin[*])eH[0]  +  ^(  n  -  H[i  -  1]). 

i=0  j=i+ 1 


(4.11) 


i=0 
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Figure  4.5:  The  error  in  inversion  of  the  rate-dependent  hysteresis  model. 
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Since  e#  [0]  =  0, 


!'  //'/■'  +  1 


<  2(^a‘)5||ff 

2=0 

Y^~  II  H  IU, 


< 


(412) 


where 


a  =  max  1 1 
se[o,C7] 


hci 

c0(l  +x) 


7  A  ,g-x 

b  =  max  - 

xg[0,<7]  1  +  X 


It’s  easy  to  verify  that 


a  =  max{—  —  1,1 - },  b  = 

co  '  co{l  +  Cy’ 


C  —  g. 

y’  i  +  c- 


Therefore  ||  e#  ||oo<  2ye  ||  H  Hoc.  The  error  e#  can  be  thought  of  as  the  output 
of  some  uncertainty  block  A#  with  the  induced  gain  less  than  or  equal  to  2 % 
(Figure  4.5(b)).  But  since  r, T-1  sit  outside  the  dashed  box  in  Figure  4.5(b), 
we  can  not  carry  A#  along  further.  Instead  we  represent  e#  as  an  exogenous 
disturbance  with  magnitude  bounded  by  2%Hm  (Figure  4.5(c)).  Eq.  (4.9)  now 
follows  using  the  Lipschitz  continuity  and  the  time  invariance  properties  of  F.  It’s 
easy  to  see  that  the  optimal  g  minimizing  the  error  bound  is  A_..  m 

Similarly  we  can  derive  the  error  bound  for  the  implicit  Euler  algorithm  (4.7): 


Proposition  4.1.6  Let  the  assumptions  in  Proposition  4-1-5  hold.  Consider  the 
implicit  Euler  algorithm  (4-5).  Then  for  any  M  £  for  any  To  £  lF. 


Cm  ||ooA  2L^fiHm, 


(4.13) 


where 


=  max{- 


g 


in  -  l  i 

'  llT^  1  +  C  +  ^ 

Co  1  Co 


C  —  g  ^cq(1  +  C)  +  Cih 


Ci  h 


The  optimal  g  to  minimize  7*  is  ■yc^+cih)+Ce  • 
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Remark  4.1.7  For  the  explicit  algorithm ,  the  step  size  h  has  to  be  chosen  small 
enough  to  ensure  stability  of  (4-8)  and  (4. 10).  The  implicit  algorithm,  however,  is 
stable  V/i  >  0.  Therefore  the  implicit  algorithm  is  preferred  in  general. 

Propositions  4.1.5  and  4.1.6  quantify  the  errors  solely  due  to  inversion  algo¬ 
rithms  (4.6)  and  (4.7).  It’s  straightforward  to  extend  the  error  estimates  to  the 
case  that  there  are  parametric  uncertainties  in  Co  and  C\  ,  e.g.,  when  Reddy  in  (3.2) 
is  not  exactly  known.  The  error  due  to  inversion  of  the  Preisach  operator  and 
the  uncertainty  in  the  Preisach  measure  can  also  be  included  as  done  in  Subsec¬ 
tion  4.1.1. 

When  the  square  operator  is  present,  like  in  the  case  of  a  magnetostrictive 
actuator,  the  estimate  of  eu  can  be  derived  from  that  of  6m-  Let  u  G  [umin,umax] 
(recall  Figure  4.1).  One  can  easily  verify  that 

||  ||ooO||  Cjl/  Hoo  T3  ||  <M  1 1 00  Umax ■ 

4.2  Formulation  of  the  Robust  Control  Problem 

We  formulate  the  robust  control  problem  precisely  in  this  section.  For  simplicity  of 
presentation,  we  consider  G0(X)  to  be  the  identity  operator,  i.e.,  we  are  interested  in 
trajectory  tracking  of  the  actuator  head  itself.  We  aim  to  convey  the  essential  ideas 
for  robust  control  of  general  smart  actuators  through  the  example  of  controlling 
the  magnetostrictive  actuator. 

Figure  4.6  shows  the  closed-loop  system  after  the  inverse  compensation  is  done, 
where  the  exogenous  noise  v  represents  the  inversion  error.  From  Section  4.1, 
II  v  ||oo<  v  where  the  bound  v  is  quantifiable  in  terms  of  inverse  schemes  and 
parametric  uncertainties.  Ga  ( A)  stands  for  the  discretized  version  of  the  second 


102 


Figure  4.6:  Robust  control  of  a  magnetostrictive  actuator. 

order  system  (3.3).  The  composition  A  o  Wq(X)  represents  the  deviation  of  the 
actual  plant  from  the  nominal  plant  Ga{ A).  We  assume  that  A  can  be  any  nonlinear 
operator  with  ||  A  || i00-ind<  1.  Ho(A)  is  a  weighting  function  and  it  reflects  that 
at  a  higher  frequency  the  model  uncertainty  is  larger. 

Let  ||  yref  ||oo<  r,  where  y,.ef  is  the  reference  trajectory.  The  error  ey  =  yref  —  y 
is  fed  into  the  controller  A  (A).  The  delay  A  following  K (A)  is  due  to  inversion  of 
the  dynamic  hysteresis  model. 

Let  the  saturation  limits  of  the  actuator  be  —u  and  u  respectively.  Then 
the  saturation  constraint  translates  into  ||  it0  ||oo<  1,  where  uq  is  as  defined  in 
Figure  4.6.  The  case  umin  ^  — umax  will  be  discussed  in  Section  4.4. 

There  are  two  delays  in  the  loop  since  Ga( A)  contains  a  pure  delay.  This 
motivates  us  to  define  the  tracking  error  eg  as 


eoIJfel  =  !,",L'  1 — Ai 


(4.14) 


where  7  >  0  is  the  desired  disturbance  attenuation  level.  To  ease  the  formulation, 
we  normalize  signals  v  and  yref ,  and  regard  v0  and  r0  as  inputs  to  the  system  with 


Figure  4.7:  Formulation  of  the  robust  control  problem. 

vo  ||oo<  1,  ||  r0  ||oo<  1  (Figure  4.6). 

The  transfer  function  G( A)  of  the  open-loop  system  is 


b(A) 


In  terms  of  G,  the  closed-loop  system  in  Figure  4.6  can  be  simplified  as  in  Fig¬ 
ure  4.7(a). 

The  control  objective  is:  find  the  smallest  7  and  a  stabilizing  controller  K( A), 
such  that 

1.  the  closed-loop  system  is  stable  for  any  A  with  ||  A  ||/OCJ-ind<  1, 

2.  ||  eo  1 1 00  A  1  if  A  =  0,  for  all  vq,  ro  with  ||  vq  ||oo<  1  and  ||  ro  ||oo<  1,  and 
3-  ||  Uq  ||oc<  1  if  A  =  0,  for  all  Vq,  r0  with  ||  v0  ||ocA  1  and  ||  r0  ||.oo<  1. 
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If  we  define  the  exogenous  input  w  and  the  regulated  output  z  as 


(  „  ^ 

( r  \ 

A 

Vo 

A 

.  X  = 

60 

v  r°  ) 

V  U°  / 

items  2  and  3  above  are  equivalent  to  the  following: 

II  ||i<  1,  (4.16) 

where  <frzw  is  the  mapping  from  w  to  z. 

By  the  small  gain  theorem  (Appendix  C),  (4.16)  is  equivalent  to  requiring 
robust  stability  of  the  system  when  we  wrap  a  nonlinear  uncertainty  block  A p  from 
z  to  w  with  ||  A p  || ix~ind<  1,  as  shown  in  Figure  4.7  (b).  Therefore  the  control 
problem  can  be  reformulated  as:  find  the  smallest  7  and  a  stabilizing  controller 
K( A),  such  that  the  closed-loop  system  in  Figure  4.7  (b)  is  robustly  stable  for  all 
A  G  A,  where  A  =  {A  =  diag{A,  Ap)  :  A  is  nonlinear  and  of  dimension  1  x  1, 
A p  is  nonlinear  and  of  dimension  2  x  2.  ||  A  |j i00-ind<  1}. 

4.3  Solving  the  Robust  Control  Problem 

To  solve  the  robust  control  problem,  we  need  determine,  for  a  fixed  7  >  0,  whether 
we  can  find  a  stabilizing  K( A),  such  that  the  closed-loop  system  is  stable,  VA  G  A. 
This  will  be  called  the  robust  control  problem,  with  disturbance  attenuation  level  7. 
From  Theorems  C.3.2  and  C.3.4,  the  robust  control  problem  with  attenuation  level 
7  is  solvable  if  and  only  if 

inf  inf  ||  D~l Fi(G,  K)D  |b<  1,  (4.17) 

stabilizing  k  r>'  D 


105 


where 


and  denotes  the  lower  Linear  Fractional  Transformation  (c.f.  Section  C.2). 

We  will  restrict  ourselves  to  finite  dimensional  LTI  (FDLTI)  controllers.  Eq.  (4.17) 
is  a  1 1  model  matching  problem  and  it  can  be  solved  as  discussed  in  Section  C.4. 
First  one  can  use  the  D  —  K  iteration  method  to  decompose  the  joint  optimization 
problem  (4.17)  into  a  sequence  of  decoupled  optimization  problems. 

In  Step  1  of  the  D  —  K  iteration  (Section  C.4),  for  a  fixed  D  £  D,  we  want  to 
solve 

inf  ||  D~1Fi(G,K)D  ||i  .  (4.18) 

stabilizing  k 

Partition  G  into  a  2  x  2  block  matrix  as  shown  in  (4.15)  and  denote  it  as 


Since  G‘2'2  is  stable,  the  set  of  stabilizing  FDLTI  controllers  K  is  parametrized  by 
(see  Corollary  C.2. 5): 

k( A)  = - (-i — .  q  g  nnk\  (4.i9) 

1 -G22Q 

and  the  scaled  achievable  closed-loop  maps  is  parametrized  by 

DlF)(G,  K)D  =  E  -  UQV ,  Q  e  77 H1*1,  (4.20) 

where  E  =  D  lGuD,  U  =  I)  'Cl2  and  V  =  G2lD.  Therefore  (4.17)  is  trans¬ 
formed  into 

inf  ||  E-UQV  Hi  .  (4.21) 

Q&nn^1 
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Problem  (4.21)  is  a  multi-block  l\  model  matching  problem  (Appendix  C).  We  can 
approximate  it  by  a  one-block  l ,  model  matching  problem  through  delay  augmen¬ 
tation  (DA)  (c.f.  Subsection  C.4.3).  The  latter  problem  is  then  solved  using  linear 
programming  (c.f.  Subsection  C.4.2).  Re-ordering  input  and  output  variables  of 
G  if  necessary,  the  lower  bound  77  and  the  upper  bound  r/,v  both  converge  to  the 
minimum  l\  norm  u°  (see  Theorem  C.4.11  for  notation)  as  the  number  of  aug¬ 
mented  delays  N  — ►  00.  We  also  obtain  a  sub-optimal  controller  for  (4.20)  from 
the  DA  method. 

Remark  4.3.1  For  the  system  we  consider ,  the  only  zeros  that  the  delay  aug¬ 
mented  matrices  Un  and  VN  (c.f.  Subsection  C.f. 3)  have  inside  the  unit  disk  are 
0’s.  This  has  two  pleasant  consequences: 

1.  In  computation  of  null  chains  (c.f.  Subsection  C.f. 2)  and  evaluation  of  the 
zero  interpolation  conditions,  relevant  coefficients  can  be  obtained  directly 
from  the  impulse  responses  and  we  thus  avoid  expensive  symbolic  calculation 
of  high  order  derivatives. 

2.  From  the  zero  interpolation  conditions  (C.17)  in  Theorem  C.f.  10,  the  upper 
bound  on  the  ( finite )  length  of  the  impulse  response  $  is  explicitly  known. 

Since  there  are  only  two  blocks  in  the  structured  uncertainty  class  A,  an  ana¬ 
lytical  expresssion  for  the  optimal  D*  exists  in  Step  2  of  the  D  —  K  iteration. 

We  now  present  some  computation  results  on  how  the  optimal  attenuation  level 
7*  is  affected  by  the  following  factors:  the  magnitude  of  uncertainty,  the  magnitude 
v  of  the  inversion  error  and  the  saturation  limit  u. 

The  sampling  frequency  we  use  is  2000  Hz.  The  corresponding  Ga{ A)  is 

-  2.23  x  10~UA2  +  4.28  x  10  1 1 A 

^  )_  0.147A2  —  0.549A  +  1  ' 
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Figure  4.8:  Effect  of  the  model  uncertainty  oil  7*. 


We  choose  the  continuous  time  weighting  function  to  be  H  o(s)  =  ,  where 

cw  >  0  determines  the  magnitude  of  the  uncertainty  in  the  plant.  Discretizing 
ld'o(s)  gives 


W0(  A) 


1.1759c*,  (A  -  1.0005) 
A  -  1.1765 


We  let  r  =  30. 


Figure  4.8  shows  the  effect  of  the  uncertainty  magnitude  on  7*.  Other  param¬ 
eters  used  are  v  =  0.1  Mg,  u  =  7.5 A/’2,  where  Ms  is  the  saturation  magnetization. 
Since  the  range  of  u  for  the  magnetostrictive  actuator  is  [0,  Mg] ,  expressing  v  and 
u  in  terms  of  Mg  allows  one  to  make  more  concrete  sense  out  of  these  numbers. 
From  Figure  4.8,  we  see  that  the  higher  the  uncertainty,  the  bigger  7*. 

Figure  4.9  displays  how  7*  varies  with  the  magnitude  v  of  the  inversion  error, 
where  we  have  fixed  cw  =  6.53  x  10“ 13  and  u  =  1.25 Mg.  As  one  expects,  the 
optimal  attenuation  level  7*  increases  as  v  increases. 

Figure  4.10  shows  how  7*  is  affected  by  the  saturation  constraint.  We  have 
used  cw  =  6.53 x  10“ 13  and  v  =  0.1  Mg.  7*  drops  when u  increases,  but  7*  becomes 
a  constant  when  u  hits  4.5 Mg,  beyond  which  the  saturation  constraint  no  longer 
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Figure  4.9:  Effect  of  the  inversion  error  on  7*. 


plays  a  role. 


4.4  Simulation  and  Experimental  Results 

In  this  section  we  conduct  simulation  and  experiments  to  examine  the  effectiveness 
of  the  robust  controller  design  method. 

The  saturation  constraint  considered  so  far  is  of  the  form  |it|  <  u.  But  for 
real  actuators,  the  saturation  limits  may  be  asymmetric,  i.e.,  umin  7^  —Umax-  For 
example,  for  the  magnetostrictive  actuators,  u  G  [0,  Mf\.  To  handle  the  general 
constraint  u  G  [umin,  umax\,  we  let  u  =  and  nb  =  Hmas±amia.  ^  The  quantity 

u  is  the  saturation  limit  to  be  used  in  the  controller  design,  while  ub  is  a  bias  input 
to  be  injected  into  the  system.  Then  the  actual  control  will  be  u  =  uc  +  ub  with 
\uc\  <  u. 

Since  the  gain  of  Wo  is  close  to  0  for  a  dc  signal,  we  can  ignore  the  contribution 
of  Ui,  to  the  actuator  output  y  through  the  A  o  IT0  branch.  Its  contribution  through 
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Figure  4.10:  Effect  of  the  saturation  limit  oil  7*. 
the  Ga  branch  can  be  calculated  as 

\Ga(\) 

Ub  = - — — — — ub- 

1  +  A  A  ( X)Ga(X) 

The  previous  robust  control  framework  applies  if  we  add  yb  to  the  reference  tra¬ 
jectory  yref  (or  alternatively,  taking  yb  off  from  y).  Figure  4.11  shows  the  flow 
diagram  for  simulation  of  trajectory  tracking. 

As  we  have  seen  from  Figure  4.10.  the  tracking  performance  deteriorates  as  the 
saturation  constraint  u  is  tightened.  For  the  magnetostrictive  actuator,  u  =  0.5  A  A2 
and  strictly  enforcing  this  constraint  will  lead  to  large  tracking  errors.  This  reveals 
the  limitation  of  pure  linear  design  for  an  intrinsically  nonlinear  plant.  Hence  a 
practical  approach  would  be  to  properly  relax  the  constraint. 

Figures  4.12  and  4.13  show  the  simulation  results  of  tracking  two  desired  tra¬ 
jectories:  a  sinusoidal  signal  and  an  irregular  signal  generated  via  a  Van  del  Pol 
oscillator.  In  the  figures,  the  desired  signals  are  intentionally  delayed  by  two  time 
steps  (recall  our  definition  of  tracking  error  (4.14)).  The  current  /  applied  is  also 
displayed.  The  controller  K  (A)  is  designed  based  on  cw  =  3.3  x  10~13,  v  =  O.IAA2, 
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ub 


Controller 


Figure  4.11:  The  flow  diagram  of  the  closed-loop  system, 
and  u  =  3.25 M2. 

Figure  4.14  shows  the  inversion  error  M[k  +  1]  —  M[k]  during  simulation  of 
tracking  the  sinusoidal  signal.  Figure  4.15  shows  the  control  output  uc  of  K( A), 
and  we  see  that  although  we  set  a  =  3.25A/2  in  the  controller  design,  the  output  uc 
stays  in  the  (true)  unsaturated  region  [— 0.5A/2,0.5AT2]  except  during  the  transient 
period  at  the  beginning. 

Our  composite  controller  (the  linear  robust  controller  plus  the  inverse  algo¬ 
rithm)  is  compilation  efficient  and  we  can  implement  it  in  real-time.  Figures  4.16 
and  4.17  show  the  experimental  results  of  trajectory  tracking  based  on  the  same 
controller  as  used  in  the  simulation.  We  can  see  that  the  experimental  results 
match  well  with  the  simulation  ones  and  the  overall  performance  is  satisfactory. 

The  saturation  limit  u  can  not  be  “over-relaxed” .  For  example,  we  design 
another  controller  based  on  f  =  25,  cw  =  3.3  x  10  13,  v  =  0.05A72,  and  u  =  5 M2. 
The  simulation  result  (Figures  4.18)  based  on  this  new  controller  is  better  than 
that  in  Figure  4.12.  But  in  the  experiment  the  tracking  performance  suffers  from 
the  persistant  saturation  (Figure  4.19). 
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Figure  4.12:  Simulation  result  of  tracking  a  sinusoidal  signal. 
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Figure  4.13:  Simulation  result  of  tracking  an  irregular  signal. 
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Figure  4.16:  Experimental  result  of  tracking  a  sinusoidal  signal. 
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Figure  4.17:  Experimental  result  of  tracking  an  irregular  signal. 


Figure  4.18:  Simulation  result  of  trajectory  tracking  based  on  an  “over-relaxed” 
controller. 


115 


Figure  4.19:  Experimental  result  of  trajectory  tracking  based  on  an  “over- relaxed” 
controller. 
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Chapter  5 


Optimal  Control  of  Hysteresis:  A 
Viscosity  Solutions  Approach 


In  this  chapter  we  study  optimal  control  of  hysteresis  in  smart  actuators.  Optimal 
control  of  dynamical  systems  with  various  hysteretic  nonlinear  it  ies  has  been  studied 
in  [18,  2,  3,  4,  11,  12].  Dynamic  programming  is  one  of  the  most  important  tools 
in  the  optimal  control  theory.  All  the  work  mentioned  earlier  except  [18]  took 
the  dynamic  programming  approach,  while  in  [18]  the  author  aimed  to  seek  the 
necessary  conditions  (the  Pontryagin  Maximum  Principle)  for  optimality. 

When  the  value  function  of  the  control  problem  is  smooth,  we  can  derive  the 
Hamilton- Jacobi-Bellman  (HJB)  equation  from  the  Dynamic  Programming  Prin¬ 
ciple  (DPP),  and  in  many  cases,  solving  the  HJB  equation  amounts  to  solving  the 
optimal  control  problem.  The  value  function  however,  in  general,  is  not  smooth 
even  for  smooth  systems,  not  to  mention  for  a  hysteretic  system.  Crandall  and  Li¬ 
ons  [26]  introduced  the  notion  of  viscosity  solutions  to  Hamilton- Jacobi  equations. 
This  turned  out  to  be  a  very  useful  concept  for  optimal  control  since  value  func¬ 
tions  of  many  optimal  control  problems  do  satisfy  the  HJB  equation  in  the  viscosity 
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sense;  and  under  mild  assumptions,  uniqueness  results  for  viscosity  solutions  hold. 
For  references  on  application  of  the  viscosity  solutions  theory  to  optimal  control 
problems,  please  see  [32,  9],  where  [32]  mainly  deals  with  stochastic  optimal  control 
problems  while  [9]  is  devoted  to  deterministic  problems. 

The  viscosity  solutions  approach  was  taken  in  [2,  3,  4,  12].  We  will  explore  this 
approach  for  control  of  hysteresis  in  smart  actuators.  We  will  first  discuss  control 
problems  based  on  a  low  dimensional  hysteresis  model  [78].  The  model  is  a  hybrid 
system  with  both  controlled  switching  and  autonomous  switching.  It  belongs  to 
the  class  of  Duhem  hysteresis  models  and  can  be  rewritten  as  a  system  involving 
both  continuous  control  and  switching  control.  Then  we  will  consider  an  optimal 
control  problem  based  on  the  dynamic  hysteresis  model  proposed  in  Chapter  3. 

Lots  of  work  has  been  done  in  control  of  hybrid  systems.  Witsenhausen  for¬ 
mulated  a  class  of  hybrid-state  continuous-time  dynamical  systems  and  studied 
an  optimal  control  problem  back  in  1966  [89] .  Yong  studied  the  optimal  control 
problem  for  a  system  with  continuous,  switching  and  impulse  controls  in  [91].  The 
Pontryagin  Maximum  Principle  or  its  variant  was  used  in  optimal  control  for  hy¬ 
brid  systems  in  [63,  68].  By  solving  the  Bellman  inequality,  a  lower  bound  on  the 
value  function  and  an  approximation  to  the  optimal  control  law  were  obtained  in 
[42,  67].  With  a  unified  model  for  hybrid  control,  Branicky,  Borkar  and  Mitter 
proposed  generalized  quasi-variational  inequalities  (GQVIs)  satisfied  by  the  value 
function  [17]. 

This  chapter  will  be  organized  as  follows.  First  we  introduce  the  low  dimen¬ 
sional  hysteresis  model  in  Section  5.1.  Based  on  this  model  we  then  formulate 
and  solve  an  infinite  time  horizon  control  problem  using  the  viscosity  solutions 
approach  in  Section  5.2.  The  approach  is  extended  to  other  control  problems  of 


118 


practical  interest  in  Section  5.3.  Finally  we  discuss  an  optimal  control  problem 
based  on  the  dynamic  hysteresis  model  (3.2)  in  Section  5.4. 

5.1  The  Low  Dimensional  Ferromagnetic  Hys¬ 
teresis  Model 

As  we  introduced  in  Chapter  2,  when  the  input  frequency  is  low,  the  magnetostric- 
tive  hysteresis  is  rate- independent.  Furthermore,  we  can  relate  the  magnetostric¬ 
tion  to  the  magnetization  M  by  a  square  law  and  relate  the  input  current  /  to  the 
magnetic  field  H  by  a  proportional  law.  Hence  in  this  case  the  magnetostrictive 
hysteresis  is  fully  captured  by  the  ferromagnetic  hysteresis  between  M  and  H. 

•files  and  Atherton  proposed  a  low  dimensional  model  for  ferromagnetic  hystere¬ 
sis,  based  upon  the  quantification  of  energy  losses  due  to  domain  wall  intersections 
with  inclusions  or  pinning  sites  within  the  material  [51].  A  modification  to  the 
•files- Atherton  model  was  made  by  Venkataraman  and  Krishnaprasad  with  rigor¬ 
ous  use  of  the  energy  balancing  principle  [84].  The  resulting  model,  named  the 
bulk  ferromagnetic  hysteresis  model  has  a  slightly  different  form  from  the  .Jiles- 
Atherton  model.  Also  based  on  the  energy  balancing  principle,  they  derived  a  bulk 
magnetostrictive  hysteresis  model  [83] ,  where  high  frequency  effects  are  considered. 

Here  we  will  restrict  ourselves  to  the  low  frequency  case  to  highlight  the  method¬ 
ology  of  hysteresis  control.  Extension  to  the  high  frequency  case  is  straightforward. 
We  now  briefly  outline  the  bulk  ferromagnetic  hysteresis  model. 

For  an  external  magnetic  field  H  and  a  bulk  magnetization  M,  we  define 

IIt,  -  II  4  aM 

to  be  the  effective  field,  where  a  is  a  mean  field  parameter  representing  inter- 
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domain  coupling.  Through  thermodynamic  considerations,  the  anhysteretic  mag¬ 
netization  Man  can  be  expressed  as 

Man(Hc)  =  Ms(coth(— )  --£-)  =  MaC(z),  (5.1) 

a  H  c 

where  £(•)  is  the  Langevin  function,  C(z)  =  cotli(z)  —  7,  with  z  —  yy,  Ms  is  the 
saturation  magnetization  of  the  material  and  a  is  a  parameter  characterizing  the 
shape  of  the  Man  curve. 

Define 


/■(//.  M)  = 

a  acMs  d{z] 

/2(HM)  = 

-  /toa(Man  (  Hc)  -  M) 

k(a  -  acMj^f)  !  //0aa(Man(He)  -  M) 

/.(//• -W)  = 

r/.\ 1  k~~~  i  //„a:M:, „(//,.)  -M) 

A-(n  -  acM/2^1)  _  //oaa(Man(He)  -  Ad) 

where  c  is  the  reversibility  constant,  Ho  is  the  penneability  of  vacuum,  k  is  a 
measure  for  the  average  energy  required  to  break  a  pinning  site.  Note  each  fi  is 
smooth  in  11  and  M. 

The  bulk  ferromagnetic  hysteresis  model  is  as  follows  [84]: 

1,  dH  <  0,  M  <  Man(//,  )  or 
dll  >  0.  M  >  Man ( Hf, ) 

2,  dH  <  0,  M  >  Man(Hc) 

3,  dll  >  0,  M  <  Man(Hc) 
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If  we  define  a  control  u  =  //.  the  model  is  rewritten  as 


V 


M 


u,  where  i  —  < 


1,  u  <  0,  M  <  Man(He )  or 
u>  0,  M  >  Man{Hc) 

2,  «  <  0,  M  >  Man(He) 


[  3,  u  >  0,  M  <  Man(Hc) 

(5.3) 


Remark  5.1.1  /Vote  that  the  control  u  defined  above  is  different  from  the  physical 
current  I  toe  apply  to  the  actuator.  The  current  I  is  related  to  the  state  component 
H  by  a  constant  cq  ( the  coil  factor):  H  —  c0I.  Therefore  from  the  control  u,  the 
current  we  will  apply  is  I(t )  =  1(0)  +  ^  f*  u(s)ds. 

Remark  5.1.2  The  switching  in  (5.3)  depends  on  both  (the  sign  of)  the  contin¬ 
uous  control  u  and  the  state  ( H,M ).  therefore  the  model  (5.3)  is  a  hybrid  system 
with  both  controlled  switching  and  autonomous  switching  [16,  17]. 


We  can  represent  (5.3)  in  a  more  compact  way.  Letting 


Sh={(H,  M)  :  M  <  Maa(He)},  M)  :  M  >  Man(Rc)}, 


and  x  =  (H,  M),  we  define 

/ 


/+(*)  = 


l  V 


1 

h(x) 

1 

Mx) 


if  x  €  Oo 


\ix  G 


<'  i  \ 


,  and  /_  (x)  =  < 


h(x) 

1 


if  x  £  O  i 


if  X  £  f  1‘2 


(  \MX) 

Since  fi,  1  <  i  <  3,  coincide  on  T  =  {( H,M )  :  M  =  Man ( Hc) } ,  f+  and  /_  are 
continuous.  We  define  two  continuous  control  sets 


A 


U+  =  { u  :  uc  >  u  >0},  IT  =  {?/  :  —uc  <  u  <  0}, 
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where  uc  >  0  represents  the  operating  bandwidth  constraint  of  the  actuator  (recall 
u  =  Col).  To  ease  the  presentation,  we  make  the  dependence  of  switching  on  u 
explicit  by  introducing  a  discrete  control  set  D  =  {1,2}. 

Now  the  model  (5.3)  can  be  represented  as  a  system  with  both  a  continuous 
control  u  and  a  discrete  mode  (switching)  control  d: 


a  f+(x)u,  u  G  U+.  if  d  =  1 
x  =  f(x,u,d )  =  {  '  .  (5.4) 

I  f-(x)u,  u  G  U-,  if  d  =  2 

The  (state-dependent)  autonomous  switching  has  now  been  incorporated  into 
the  definitions  of  /+ ,  /_,  thanks  to  the  nice  structure  of  the  physical  model.  Note 
the  model  (5.4)  belongs  to  the  category  of  Duhem  hysteresis  model  [86]. 


5.1.1  Properties  of  the  model 

We  first  present  a  lemma  which  will  be  useful  in  the  proof  of  Proposition  5.1.4. 


Lemma  5.1.3  [77]  C(z)  satisfies: 


Proof 


Therefore 


c)C(z)  1 
dz  ~  3’ 
\C(z)\  <  1. 


dc(z) 

dz 


—  -  csch2(z ) 

1  1 

^  — r 

1  1 

^~(^  +  #  +  ¥ +  *")2‘ 


dc(z) 

dz 


>  0 0. 


(5.5) 

(5.6) 


(5.7) 

(5.8) 
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Further  manipulation  on  (5.7)  yields 


dC{z)  _  (2  +  |f  +  yr  j  ) ( 3l  +  If  ^  ) 

^  "  <1  +  #  +  !f  +  -">2 


from  which  we  obtain 


dC ,  ,  1 

&(0)  =  r 


Combining  (5.8)  and  (5.10)  we  have 


dC(z) 


(5.10) 


(5.11) 


Since  in  addition, 


lim  C( z)  =  1,  lim  C( z)  =  —  1, 


we  have  (5.6). 

To  prove  <  tt.  ii  suffices  to  show 


&><ov,>a 


d2C(z)  _  8{ez  +  e-z)  2 

i)z2  (ez  -  e_z)3  23 


=  2 


hr  +  yH  +  T  +  ¥  + 

Q  /  1  ,  *2  ,  y-X.  \  Q 


£1  +  _  J _ )3 

3!  ^  5!  '  / 


so  we  need  only  to  show  that  the  numerator  of  (5.13)  is  always  less  than  0,  Vz  ^  0. 
We  first  note  that  the  coefficient  of  z2k  ,k>  1  in  the  second  term  is 


3^(2fc  +  l)!  +  (2k-  1)!3!  +"^  >  3  (2k  —  1)!3! 


(2k)  V  3! 

1 
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while  jr^y  is  the  coefficient  of  z2k  in  the  first  term.  For  k  =  0, 1,  the  coefficients  of 
both  terms  cancel  out.  The  proof  is  now  complete.  ■ 


Proposition  5.1.4  (Boundedness  of  /,  )  If  the  parameters  satisfy: 

A  acMs 

1 1  =  a - >  0, 

3 

T‘2  =  k(a - — - — )  —  2 (i^aaMs  >  0, 

O 

then  0  <  fi  <  Cf  ,  i  =  1,2, 3,  for  some  constant  Cf  >  0. 


(5.14) 

(5.15) 


Proof  By  (5.5)  and  (5.14) 


m  acMs  , ,  OCiz) 

0  <  T\  =  a - - —  <  a  —  acMs — - —  <  a. 

3  oz 


We  rewrite  f  \  as 


f  1  i  n 

a  a{a-ac:Ms?%±) 

and  note  that  it  is  a  nondecreasing  function  of  .  Since 


fi  =  0  when  =  0, 


h  = 


cM3 


3  a  —  acAL 


=:  Ci  when  ^ 


we  get 


0  <  fi  <  C\. 


The  function  f  2  can  be  written  as 

,.  1  ka 

J  2  —  — 


1 


a  a  ,  ,  . ,  dC(z ) 

k(a  —  acMs - )  + 

oz 


=:T 


Alan  ~~ 


From  the  model  (5.3),  when  f2  is  selected,  Man  —  M  <  0.  Since  magnitudes  of 
both  Man  and  M  must  be  less  than  Ms,  Man  —  M  >  —2 Ms.  These  facts  together 
with  (5.5)  yield 

0  <  T2  <  T  <  ka. 
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Therefore 


o  <  h  <  =■  C2. 

Oil  2 

Similarly  we  can  show7  0  <  fs  <  C2.  Picking  Cf  =  max{Ci ,  Co},  we  have  0  <  fi  < 
Cf  for  *  =  1,2,3.  ■ 


Remark  5.1.5  Conditions  (5.14)  and  (5.15)  are  satisfied  for  typical  parameters. 
For  example,  taking  the  parameters  identified  in  [82],  a  —  1.9  x  10-4,  a  =  190, 
k  =  48  Gauss,  c  =  0.3,  Ms  —  9.89  x  103  Gauss  and  no  —  1,  we  calculate  7\  =  189.8  , 
T2  =  8.40  x  103. 


Proposition  5.1.6  (Lipschitz  continuity)  Functions  f+(x)  and  f-(x)  are  Lip- 
schitz  continuous  with  some  Lipschitz  constant  L,  and  f(x,  u,d )  is  Lipschitz  con¬ 
tinuous  with  respect  to  x  with  Lipschitz  constant  L0  =  Luc . 

Proof  We  first  prove  /_  is  Lipschitz  continuous  with  some  Lipschitz  constant 
L_.  We  discuss  three  cases: 

•  Case  I:  Both  x\,x2  €  Qj.  In  this  case,  mode  1  is  active,  and  thus 

df-(x)  _  I  0  0 

Ox  dfj(H.M)  ilfy.  II, Mi 

y  OH  dM 

It  can  be  shown  that  I  Ul  |  <  C\  ,  \  \  <  ocC\  for  some  C\  >  0. 

Therefore  |  d^f) )  |  <  L\  for  some  Li  >  0,  and  the  following  holds: 

|/-Oi)  -  f-(x 2)|  <  Li \xi  -  X2\. 


•  Case  II:  Both  x\,x2  €  kl2.  In  this  case,  mode  2  is  active.  Following  similar 
steps  as  in  Case  I,  we  can  show  | <  C2  for  some  L2  >  0  and  therefore 


l/-(*i)  -  /-(® 2)|  <  L2 \xi  -  x2 
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•  Case  III:  X\  E  0 1 ,  x2  E  £22.  Then  there  exists  x0  E  T,  such  that  the  line 
segment  connecting  x\  and  x->  intersects  T  at  x0.  We  express  Xq  =  9x\  +  (1  — 
9)x2  with  0  <  6  <  1.  Thus 

|/-(a?i)  -  /-(*2)|  =  \f-(xi)  -  f~(x0)  +  f-(x0)  -  f-{x2)\ 

<  Li\xi  -  x0\  +  L2\xq  -  x-2 1 

=  Li(  1  -  9) \xi  -  x2\  +  L29\xi  -  x2\ 

<  L_ \xi-x2\, 

with  L _  =  ma.x{Li,  L2}. 

Following  exactly  the  same  arguments,  we  can  show,  there  exists  L+  >  0,  such 
that 

I f+(xi)  -  f+(x2) |  <  L+\xi  -x2\,  Vxux2. 

We  conclude  the  first,  part  by  taking  L  =  max{L_ ,  L+}.  The  rest  of  the  proposition 
follows  trivially.  ■ 

5.2  The  Infinite  Time  Horizon  Optimal  Control 
Problem 

We  first  formulate  an  infinite  time  horizon  optimal  control  problem  for  the  system 
(5.4).  Define  the  cost  functional  with  an  initial  condition  x  and  a  control  pair 
<*(•)  =  M-)w(-)}  as 

POO 

J(x,  «(•))=  /  l(x(t),u(t))e~Xtdt,  (5-17) 

Jo 

where  the  discount  factor  A  >  0.  Note  the  running  cost  /(•,•)  is  defined  to  be 
independent  of  the  switching  control  d,  since  this  makes  sense  in  the  context  of 
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smart  actuator  control.  We  require  u(-)  to  be  measurable.  This  together  with 
Proposition  5.1.6  guarantees  that  (5.4)  has  a  unique  solution  x(-)  (the  dependence 
of  x(-)  on  x  and  q(-)  is  suppressed  when  no  confusion  arises). 

The  optimal  control  problem  is  to  find  the  value  function 

V (x)  =  inf  J(x ,  «(•)), 

a(-) 

and  if  V(x)  is  achievable,  find  the  optimal  control  «*(•). 

We  make  the  following  assumptions  about  /(•,  •): 

•  {Ax):  l(x,u)  continuous  with  respect  to  x  and  u,  l(x,u )  >  0,  \/x,u; 

•  (A2):  1(0,0 )  =  0,  | l(xi,u)  -  l(xo,u)\  <  Ci(  1  +  |aq|  +  |a;2|)|.x'i  -  x2\,  Vu,  for 
some  Ci  >  0. 

Note  (A2)  includes  the  case  of  quadratic  cost. 

5.2.1  Properties  of  the  value  function 

We  can  show  the  value  function  is  locally  bounded  and  locally  Lipschitz  continuous. 

Proposition  5.2.1  (Local  boundedness)  Under  assumptions  (A:1)  and  (A2), 
VA  >  0,  V(x)  is  locally  bounded ,  i.e.,  Vi?.  >  0,  3  Cr  >  0,  such  that 

|P(.r)|  <  CR,  Vx  E  B{ 0,  R)  =  {x  :  |ar|  <  R}. 

Proof  First  note  that  since  /(•,•)  is  nonnegative,  V(x)  >  0,  Mx.  Take  u(t)  =  0, 
then  x(t)  =  x.  Letting  a(t)  =  {d(t),u(t)}  where  d(t)  =  1,  we  have 

f'OG 

V(x)  <  J(x,  «(•))  =  /  l(x,0)e^Mdt 

Jo 

l(x,  0) 

A  ‘ 
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By  (A2),  l(x,  0)  <  Ci(  1  +  /?)/?,  and  the  proof  is  complete  with  Cr  =  C'd+R)R^  ■ 
To  prove  the  local  Lipschitz  continuity  of  the  value  function,  we  will  need  the 
following  lemma: 

Lemma  5.2.2  Let  x\  (•),  x2(-)  be  solutions  to  (5-4)  under  some  admissible  control 
a(-)  =  {d(-),u(-)}  with  initial  condition  x i,x2  respectively.  Then 

\x\  (i)  —  x2{t) |  <  eLot\xi  —  x2\ ,  (5.18) 

|oq(i)|  <  |ou|eiot  +  ^—(eLot  —  1),  (5.19) 

To 

where  C  =  max(i  \f(0,uc,d)\,  and  L0  is  as  defined  in  Proposition  5.1.6. 

Proof  1.  We  first  show  (5.18).  Denote  the  sequence  of  mode  switching  times  as 
{U,  i  =  0, 1,  •  •  •  }  with  t0  =  0,  and  the  mode  during  [U,  ti+i)  as  d*.  Then  Vi  G  [0,  t, :L), 

-  X2{t) |2  =  2{xi (t)  -  x2{t))  ■  {f{xy {t),u(t),  do)  -  f(x2(t),u(t),da) 

<  2L0\xl{t)  -  x2{t) |2, 

where  the  inequality  conies  from  Proposition  5.1.6.  Integrating  both  sides  from  0 
to  t  and  applying  the  Gronwall  inequality,  we  get 

|.r,  (/ )  -  x2(t) |2  <  \x\  -  x2\2e2Lot, 

from  which  (5.18)  follows.  Now  Vi  G  [fi,t2),  taking  Xi(t\) , x2(t\)  as  initial  condi¬ 
tions,  we  follow  the  above  procedures  and  get 

\xi(t)  —  x2(t)\  <  Mh)  - 

<  \x-i  -  x2|eLo(t“tl)eLo(tl"0)  =  I*!  -  x2\eLot. 

Using  the  same  argument  successively,  we  can  show  that  (5.18)  holds  Vi  >  0. 
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2.  Now  we  show  (5.19).  Vxi(t)  0,  we  can  write 


\xi(t)\^\xi(t)\  =  ^kiWI2  =  ®i(*)  •  f(xi(t),u(t),d(t)) 

=  xi(t)  ■  (f(0,u(t),d(t))  +  f(xi(t),  u{t),d(t))  -  f(0,u(t),d(t))) 
<  C\xi(t)\  +  L0\xi(t)\'\ 

from  which  we  obtain 

^Mt)|  <  C  +  L0\xi{t)\. 

Integrating  it  from  0  to  t  and  then  using  the  Gronwall  inequality,  we  have  (5.19). 


Proposition  5.2.3  (Local  Lipschitz  continuity)  Under  assumptions  (A,)  and 
(A2),  VA  >  2 L0  with  Lq  as  defined  in  Proposition  5.1.6,  V(x)  is  locally  Lipschitz, 
i.e.,  Vi?  >  0, 3Lr  >  0,  such  that  |U(;ri)  —  V(x2)\  <  Lr\xi~ x2\,  Vxi,x2  G  B(0,R). 
In  addition,  LR  can  be  chosen  to  be  C(l  +  R)  for  some  C  >  0. 

Proof  For  e  >  0,  let  ae(-)  =  {de(-),  ue(-)}  be  e— optimal  for  x2 ,  i.e.. 

V(x2)>J(x2,aef))-e. 

Since  V{x\)  <  J[.r\ .  <\'  [■))■  we  have 

V{xf)-V{x2)  <  J(xuae(-))-  J(x2,ae(-)) 

poo 

<  /  e-^\l(x1(t),ue(t))-l(x2(t),uemdt  +  ^ 

Jo 

<  [  e~xtQ(  1  +  |a?i(t)|  +  Ix^BjDlxx (t)  -  x2(t)\dt  +  e, 

Jo 

where  the  last  inequality  is  from  (Af).  Using  Lemma  5.2.2,  we  get 

V(x{)  -  V(x2)  <  Lr\xi  -  x2\  +  e, 
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where  LR  =  (C0  +  \^Lo)Ci  and  C0  is  a  constant  independent  of  R.  Since  e  is 
arbitrary,  we  have 

V(xi)  -  V(x2)  <  Lr\xi  -  x2\. 

The  proof  is  complete  by  noting  that  X\  and  x2  are  symmetric.  ■ 

Remark  5.2.4  One  can  get  a  sharper  estimate  for  |./-|  (/ )  (linear  growth )  by  ex¬ 
ploiting  Proposition  5.1.4-  This  can  be  used  to  weaken  the  condition  A  >  2 L0  to 
A  >  Lq  in  Proposition  5.2.3  and  anywhere  else  it  appears. 

5.2.2  The  Dynamic  Programming  Principle  and  the  Hamilton- 
Jacobi-Bellman  equation 

The  value  function  satisfies  the  Dynamic  Programming  Principle  (DPP): 

Proposition  5.2.5  (DPP)  Assam, e  (Ti  )  and  (A2),  A  >  0.  We  have 

V(x)  =  inf {  [  e~Xsl(x(s),  u{s))ds  +  e~xtV(x(t))},  Vi  >  0,  \/x.  (5.20) 

The  proof  is  omitted  since  the  argument  is  standard,  see,  e.g.,  [9]. 

Based  on  the  DPP,  we  can  show  that  the  value  function  V (■)  satisfies  a  Hamilton- 
Jacobi-Bellman  equation  (HJB)  of  a  hybrid  type  in  the  viscosity  sense.  Viscosity 
solutions  to  Hamilton-Jacobi  equations  were  first  introduced  by  Crandall  and  Lions 
[26].  Here  we  use  one  of  the  three  equivalent  definitions  [25]: 

Definition  5.2.6  (Viscosity  solutions)  [9]  Let  W  be  a  continuous  function  from 
an  open  set  O  C  Mn  into  R  and  let  DW  denote  the  gradient  of  W  ( when  W  is 
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differentiable ).  We  call  W  a  viscosity  solution  to  a  nonlinear  first  order  partial 
differential  equation 

F{x,  W(x),  DW(x ))  =  0,  x  G  O,  (5.21) 

where  F  :  O  x  R  x  Rn  — >  R  is  continuous ,  if  W  is  both  a  viscosity  subsolution 
and  viscosity  supersolution;  and  by  viscosity  subsolution  (supersolution,  resp.),  we 
mean:  \/<f>  G  Cl(0),  ifW—6  attains  a  local  maximum  (minimum,  resp.)  at  Xq  G  O, 
then  F(xq,W(xq),D4>(xq))  <  0  (>  0,  resp.). 

Viscosity  solutions  have  a  couple  of  nice  properties  [25,  26].  We  mention  one 
elementary  property  here,  consistency  with  the  notion  of  classical  solution,  that  is: 
1)  any  classical  solution  to  (5.21)  is  a  viscosity  solution;  2)  the  viscosity  solution 
satisfies  (5.21)  in  the  classical  sense  at  any  point  where  it  is  differentiable. 

Theorem  5.2.7  (HJB)  Assume  (.4,)  and  (A>),  A  >  2 L0.  V(x)  is  a  viscosity 
solution  of: 

A W(x)  +  max{  max{- u  f+(x)  ■  DW(x)  —  l(x,  n)}, 

U&J+ 

max{  —  uf-(x)  ■  DW(x )  —  l(x,u)}}  =  0,  x  G  R2.  (5.22) 

u&U- 

Proof  1.  We  first  show  V{-)  is  a  viscosity  subsolution.  For  any  u  G  U-,  take 
q(-)  =  (d(-),  it(-)}  with  d(t)  =  2,  u(t)  =  u.  From  (5.20),  for  any  t  >  0, 

V(x )  <  /  l(x(s),u)e~Xsds  +  e~xtV(x(t)), 

Jo 

which  we  rewrite  as 

V(x{t))  -  V(x)  +  /  l(x(s),u)e~Xsds  +  V(x(t))(e~xt  -  1)  >  0.  (5.23) 

Jo 
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Now  suppose  that  V  —  6  with  6  £  Cl  (R2)  has  a  local  maximum  at  x,  then 


V(x(t ))  —  6{x(t))  <  V{x)  —  4>{x), 


for  t  sufficiently  small.  This  together  with  (5.23)  implies 


4>(x(t))  —  4>(x)  +  /  l(x(s),u)e  Asds  +  V(x(t))(e  —  1)  >  0.  (5.24) 


Divide  (5.24)  by  t  and  let  t  — »■  0,  we  obtain 


uf-(x )  •  D6(x)  +  l(x,u)  —  XV (x)  >  0,  Vu  E  U-, 


i.e., 


XV (x)  +  max{— uf-{x)  ■  D6(x)  —  l(x,u)}  <  0. 

ue(/_ 


Similarly,  we  have 


XV(x)  +  max{— u  f+(x)  ■  Dd>(x )  —  l(x,u)}  <  0. 

ueu+ 


Therefore 


XV (x)  +  max{ max { — u/_ (x)  ■  Dd)(x )  —  l(x,u)}, 

u&U- 

max{— uf+(x)  ■  Dd)(x )  —  l(x,u)}}  <  0.  (5.25) 

uef/+ 

2.  The  proof  of  supersolution  is  much  more  technically  involved  and  therefore 
omitted  here.  It  can  be  found  in  [77].  ■ 

5.2.3  Uniqueness  of  the  solution  to  the  HJB  equation 

We  would  like  to  characterize  the  value  function  V  as  a  unique  solution  to  the 
HJB  equation.  The  uniqueness  result  basically  follows  from  Theorem  1.5  in  [47]. 
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In  [47],  the  author  gave  only  a  sketch  of  proof.  For  completeness,  we  will  provide 
the  full  proof  here. 

Before  stating  the  theorem,  we  first  identify  structural  properties  of  the  HJB 
equation.  We  rewrite  (5.22)  as: 

A W(x)  +  H(x ,  DW{x ))  =  0,  x  e  K'2,  (5.26) 

where 

H(x,p)  =  max{max{-«/+(a:)  •  p  —  l(x,  u)},  max{—  uf-(x)  ■  p  —  l(x,  u)}}, 

m  I : .  n-  I  ' 

is  called  the  Hamiltonian  of  (5.26). 

Proposition  5.2.8  Assume  (A2).  H(x,p )  satisfies  the  following: 

\H(xi,p )  -  H(x2,p)\  <  Cr(1  +  |p|)|.n  -  Xo | ,  Vxi,x2  e  B{ 0,  R),  Vp,  (5.27) 
\H(x,pi)  -  H(x,p2)\  <  Co \pi  —  p2 1 ,  Va\  Vpi , p2,  (5.28) 

for  some  Cr  >  0,  C0  >  0,  with  C r  dependent  on  R. 

Proof  We  will  only  prove  (5.27),  since  proof  of  (5.28)  is  analogous. 

Without  loss  of  generality,  suppose  U\  G  U-  attains  the  maximum  in  H(xi,p). 
Since  H{x2,p )  >  - ulf-{x2 )  •  p  -  l(x2,ui), 

H(xup)  -  H(x2,p)  <  -uif-(xi)  ■  p  -  l{xi,uf)  +  uif-(x2)  ■  p  +  l(x 2,  «i) 

<  \p\L0\xi  -  x2\  +  Ci(  1  +  \xi |  +  LrhjLri  -  ,x2| 

<  CR(1  +  \p\)\xi  -  x2\, 

where  Gr  is  a  constant  dependent  on  R.  By  symmetry,  we  conclude.  ■ 

Remark  5.2.9  As  we  have  seen  above ,  despite  the  hybrid  structure  of  our  physical 
model ,  H(x,p)  enjoys  nice  structural  properties,  which  enables  us  to  prove  the 
uniqueness  result. 
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From  Proposition  5.2.3,  we  know  that  the  value  function  P(-)  belongs  to  the 

class 

V(R2)  =  {W(-)  :  | W(xi)  -  W(xo)\  <  C{  1  +  R)\xx  -  x2\yx1,x2  G  B( 0,  R), 

V  R  >  0,  for  some  C  >  0}. 

The  following  theorem  is  adapted  from  Theorem  1.5  in  [47]. 

Theorem  5.2.10  If  (5.26)  has  a  viscosity  solution  in  V(R2),  it  is  unique. 

Proof  Without  loss  of  generality,  we  take  A  =  1.  Let  W(-),V(-)  G'P(M2)  be 
viscosity  solutions  to  (5.26).  For  e  >  0,a  >  0 ,m  >  2,  define 

$(x,y)  =  W(x)  -  V(y)  -  ^  ~  V -  a(<  x  >m  +  <y  >rn), 

f 

with  <  x  >=  \J  1  +  |.x’|2.  Since  W7(-),  P(-)  GP(R2),  lini|a.|+|3/|_>00  T(.x',  y)=  —  oo.  By 
continuity  of  $(•,•),  there  exists  (;r0,|/o)  where  T  attains  the  global  maximum. 
First  we  want  to  obtain  bounds  for  |;r0|.  \ijo\  and  |a.-o  —  yo\- 

From  $(0,0)  <  $(xo,yo)>  and  W(-),V(-)  eV{K2),  we  can  get 

<  x0  >m  +  <  y0  >m<  Ca(  1+  <  xq  >2  +  <  |/o  >2), 

where  Ca  is  a  constant  independent  of  e  (but  dependent  on  a).  Since  m  >  2,  there 
exists  Ra  >  0  (independent  of  e),  such  that  |a;0|  <  Ra ,  |y0|  <  i?.Q . 

From  $(.To,.To)  +  T(|/o, |/o)  <  2  $(a;0,|/o),  we  can  derive 

\x0  -  yo\  <  eC'a,  (5.29) 

with  6"  depending  on  a  only. 

Define 

6{x)  =  V (y0)  +  -|x  -  i/o | 2  +  a(<  rr  >m  +  <  y0  >m ), 
e 

D(y)  =  W(x 0)  -  -|.r0  -  yf  -  a(<  a;0  >m  +  <  y  >m). 
e 
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Since  W  —  <p  achieves  maximum  at  Xq,  and  V  —  ip  achieves  minimum  at  y0, 


(5.30) 

(5.31) 


W(x o)  +  H{x o,  D6(x0))  <  0, 

V(yo)  +  H(y0,Dtjj(yo))  >  0. 

Subtracting  (5.31)  from  (5.30)  and  using  Proposition  5.2.8,  we  have 

W(xo)  -  I r(y0)  <  CRa{l  +  -\xo  -  yo\)\xo  -  y0 \ 

e 

+aCQm(<  xq  >m_1  +  <  yo  >m_i). 

Now  fix  a,  construct  a  sequence  {e*,}  with  limfe_00  =  0.  We  denote  the 
corresponding  maximizers  of  4»  as  (.x0/,:,  y0k)-  Since  VA*,  (xok,  yok)  €  5(0,  Ra),  by 
extracting  a  subsequence  if  necessary,  we  get 

lim  (xok,  ijok)  -»•  (xa ,  ija)  e  5(0,  Ra).  (5.32) 

k— *oo 

From  (5.29),  we  have  xa  =  ya-  For  each  ffc,  from  $(;r, x)  <  $(xok,yok),  we  can  get 

W  (x)  -  V(x)  -  2 a  <  x  > m  <  CRn  (1  +  — | xok  -  yok  I )  I  xok  -  yok  \ 

+  aC0m{<  xok  >m~'  +  <  yok  >m_l)  -  a(<  xok  >m  +  <  yok  >m), 

and  letting  k  oo, 

W(x)  -  V(x )  <  2a(C'0m  <  xQ  >m~l  -  <  xQ  >m)  +  2o  <  x  >m  . 

Since  Corn  <  ,xQ.  >m_1  —  <  xa  >m<  C"  for  some  C"  >  0, 


W(x)  -  V(x)  <  2 a(C"+  <  x  >m). 

Letting  a  — >■  0,  we  get  W(x)  —  V(x)  <  0,  Var.  We  conclude  by  noting  W  and  V 
are  symmetric.  ■ 

From  Theorem  5.2.10,  if  we  can  solve  for  a  solution  to  (5.26)  in  'P(R2),  it  must 
be  the  value  function.  One  way  to  solve  it  is  by  the  discrete-time  approximation. 
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5.2.4  The  discrete  approximation  scheme 

The  approximation  will  be  accomplished  in  two  steps.  First  we  approximate  the 
continuous  time  optimal  control  problem  by  a  discrete  time  problem,  derive  the 
hybrid  discrete  Bellman  equation  (DBE),  and  show  the  value  function  of  the  dis¬ 
crete  time  problem  converges  to  that  of  the  continuous  time  problem  locally  uni¬ 
formly.  Following  [9],  we  call  this  step  "semi-discrete”  approximation.  Then  we 
indicate  how  to  further  discretize  (DBE)  in  the  spatial  variable,  which  is  called 
“fully-discrete”  approximation.  The  approaches  we  take  here  follow  closely  those 
in  [9] (Chapter  VI  and  Appendix  A). 

Consider  a  discrete  time  problem  obtained  by  discretizing  the  original  contin¬ 
uous  time  one  with  time  step  h  €  (0,  \  ).  The  dynamics  is  given  by 

x[n\  =  x[n  —  1]  +  hf(x[n  —  1],  u[n  —  1],  d[n  —  1]),  :c[0]  =  x,  (5.33) 

and  the  cost  is  given  by 

OO 

Jh(x,a[-])  =  ^  hl(x[n],u[n])(  1  —  A h)n,  (5.34) 

n=0 

where  q[-]  =  {d[-],  u[-]}  is  the  control.  The  value  function  is  defined  to  be 

Vh{x)  =  inf  .Jh(x,  «[•]).  (5.35) 

«[•] 

It’s  not  hard  to  show: 

Proposition  5.2.11  Assume  A\  and  A2,  A  >  2 L0.  Then  I4(-)  G  V(R2),  and  the 
coefficient  C  in  defining  'P(R2)  can  be  made  independent  of  h. 

Following  standard  arguments,  one  can  show: 
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Proposition  5.2.12  (DBE)  Vh(-)  satisfies: 


Vh{x)  =  min{  min  {(1  —  \h) Vh (x  +  huf+(x ))  +  hl(x,  it)}, 
ueu+ 

min  {(1  —  A h)Vh(x  +  huf(x ))  +  hl(x, «)}},  x  £  Mi2.  (5.36) 

7 


It’s  of  interest  to  know  whether  (5.36)  characterizes  the  value  function  Vh{x). 
Unlike  in  [9]  (Chapter  VI),  where  a  bounded  value  function  was  considered,  we 
have  Vh  unbounded.  But  it  turns  out  that  with  a  little  bit  additional  assumption, 
(5.36)  has  a  unique  solution. 


Proposition  5.2.13 


There  exists  a  unique  solution  in  V(R'2)  to  (5.36),  if 

( 1  ~  A/Q (  y/Cp  +  4  +  Cp)  ^  1 
C^Tm-Co 


(5.37) 


where  C0  =  huc 


1 

Cf 


and  C\  is  as  defined  in  Proposition  5.1.4- 


Proof  Let  Vfix)  =  V/fix)  <  x  >  m,rn  >  2,  where  <  x  >=  fi'l  +  \x\2.  Since 
Vh  G  Vh  is  bounded.  In  terms  of  Vh,  (5.36)  is  rewritten  as 

Vh(x)  =  {G(Vh))(x)  =  mm{  (5.38) 

min { ( 1  -  Xh)Vh{x  +  huf+(x))  <  +  h"-f+['r)  >m  +  hl(x.u)  <  x  >~m}, 

«ef/+  <  x  >m 

min{(l  —  A h))Vh(x  +  huf-(x))<  1  i > — 1_  hl(x,u)  <  x  >^rn}}. 

uef/_  <  x  >m 


It  suffices  to  show  (5.38)  has  a  unique  solution.  It’s  clear  that  the  operator  £/(•) 
maps  any  W  £  BC(Ml2)  into  BC( R2),  where  BC(R2)  denotes  the  set  of  bounded 
continuous  functions.  When  (5.37)  is  satisfied,  one  can  show  that  Q{-)  is  a  con¬ 
traction  mapping.  Hence  we  conclude  using  the  contraction  mapping  theorem. 
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The  following  theorem  asserts  that  14(0  converges  to  V(-)  as  h  — >■  0.  The  proof 
can  be  found  in  [9] (Chapter  VI) (with  minor  modification). 

Theorem  5.2.14  Assume  A\  and  A2,  A  >  2 L0,  and  (5.37).  Then 

sup  | Vh(x)  —  V(x)\  — *■  0  as  h  — ■>  0,  (5.39) 

for  every  compact  JC  Cl2. 

It  was  also  shown  in  [9]  that  one  can  obtain  a  sub-optimal  control  for  the 
continuous  time  problem  when  solving  the  DBE.  Theoretically  the  solution  to 
(5.36)  can  be  obtained  by  successive  approximation.  A  practical  approximation 
scheme  for  solving  the  DBE  is  described  in  [9]  (Appendix  A,  by  Falcone).  It 
was  shown  there  that  when  space  discretization  gets  finer  and  finer,  the  solution 
obtained  via  solving  a  finite  system  of  equations  converges  to  I4(-). 


5.3  Other  Control  Problems 


In  this  section,  we  briefly  discuss  how  to  extend  the  viscosity  solutions  approach 
to  other  control  problems  of  practical  interest.  Since  key  ideas  have  been  studied 
in  details  in  Section  5.2,  we  will  just  state  the  results  without  proof. 

In  some  optimal  control  problems  or  dynamical  games,  the  value  functions 
may  be  discontinuous.  To  handle  this  problem,  we  introduce  the  notion  of  non- 
continuous  viscosity  solutions. 

We  recall  that  a  function  /  :  E  C  Rn  — »  R  is  upper  (lower,  resp.)  semicontin- 
uous  if  for  any  x  €  E  and  e  >  0,  there  exists  6  >  0,  such  that  f(y )  <  f(x)  +  e 
(f(y)  >  f(x)  —  e,  resp.)  for  all  y  G  E  and  | y  —  x\  <  6.  An  upper  (lower,  resp.) 
semicontinuous  function  achieves  its  maximum  (minimum,  resp.)  value  on  any 
compact  subset  of  E. 
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Definition  5.3.1  (Semicontinuous  semisolutions)  [9]  Let  W  be  an  upper  (lower, 

resp.)  semicontinuous  function  from  an  open  set  O  C  Rra  into  R.  W  is  called  a  vis¬ 
cosity  subsolution  (. supersolution ,  resp.)  to  (5.21),  provided  \/<p  G  Cl{0),  if  11’  •  o 
attains  a  local  maximum.  ( minimum ,  resp.)  at  Xq  G  O,  then 

I T (./•„).  Di  >(./■(, ) )  <  0  (>  0,  resp.). 


Definition  5.3.2  (Semicontinuous  envelopes)  [9]  For  a  locally  bounded  func¬ 
tion  V  :  E  ci"->  R,  its  upper  semicontinuous  envelope  V*  is  defined  by 

V*(x)  =  limsup  V(y)  =  lim  sup{D(y)  :  y  G  E,  \y  —  x\  <  r}, 

y^x  '—0+ 

and  its  lower  semicontinuous  envelope  V’  is  defined  by 

Vfix)  =  lim  inf  V(y)  =  lim  inf{V(y)  :  y  G  E,  \y  —  .x’|  <  r}. 

y — x  '  r— 0+ 


It’s  easy  to  check  that  V*  is  upper  semicontinuous  and  F*  is  lower  semicontin¬ 
uous. 

Definition  5.3.3  (Non-continuous  viscosity  solutions)  A  locally  bounded  func¬ 
tion  V  is  a  non-continuous  viscosity  solution  of  (5.21)  if  V*  is  a  subsolution  of 
(5.21)  and  F*  is  a  supersolution  of  (5.21)  according  to  Definition  5.3.1. 

We  note  that  Definition  5.3.3  coincides  with  Definition  5.2.6  if  the  function  is 
continuous.  Hence  from  now  on,  whenever  we  say  a  viscosity  solution,  it  should 
be  understood  in  the  sense  of  Definition  5.3.3. 
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5.3.1  The  finite  time  horizon  optimal  control  problem 

We  define  the  cost  functional  for  the  finite  time  horizon  problem:  starting  from  x 
at  time  t, 

J(x,t,a(-))  —  J  l(x(r),u(T),T)e-XTdT  +  g(x(T)),  (5.40) 

where  T  >  0,  0  <  t  <  T,  x  G  M",  A  >  0,  and  g  is  the  terminal  cost.  The  value 
function  V (x,  t)  is  defined  as 

V(x.t)  =  inf  J(x,t,  «(•))• 

«(•) 

We  assume: 

•  (A3):  /(•,•,•)  is  continuous,  /  >  0: 

•  (A4):  | l(xi,u,t)  -  l(x 2,  u,t) |  <  Ci ( 1  +  |ari|  +  |ar2|)|a:i  -  x2\,  V«,Vari,a;2,Vi  G 
[0,T],  for  some  C/  >  0; 

•  (A5):  \g(xi)  -g{x 2)|  <  Ce(  1  +  |;r:l  |  +  |ar2|)|aJi  ^  ar2|, Vx1}x2. 

Example  5.3.4  A  trajectory  tracking  problem  on  a  finite  interval  can  be  formu¬ 
lated  as  above  with  A  =  0,  g  =  0,  and 

l(x,  u{t) ,  t)  =  q(t)(x2  -  X‘2 (t))2  + 

where  q(t )  >  0,  Vt,  .ro  denotes  the  M  component  of  x  and  x2(-)  is  a  bounded, 
desired  trajectory  of  M. 

We  can  show  the  value  function  has  some  nice  properties. 

Proposition  5.3.5  Under  assumptions  (A3)  —  (A5),  V{x,t)  is  locally  bounded. 
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Proposition  5.3.6  Under  assumptions  (A3)  —  (A5) ,  V{x,t)  is  locally  Lipshitz  con¬ 
tinuous,  i.e.,  V  R  >  0, 

\V(xi,t)  ~  V(x2,s) |  <  Lr( \x-i  -x2\  +  1 1-  -s- 1 ) . V ./•  i . ./•-_>  G  B(0,R),  \/t,s  G  [0,T], 

(5.41) 

and  LR  can  be  written  as  Lr  =  C(1  +  R)  for  some  C  >  0. 

Therefore  V  belongs  to  the  following  class  of  functions  on  R2  x  (0,7’): 

P,(R2  x  (0,7’))  =  {TF(-,  •)  :  \W(xi,t)  -  W(x2,s)\  <  C(  1  +  i2)(|an  -  x2\  +  \t  -  ,s|), 

Uxi,x2  G  B(0,  R),\/  R  >  0,  V  t,  s  G  (0,7’),  for  some  C  >  0}. 

We  can  prove: 

Proposition  5.3.7  The  value  function  V  is  a  viscosity  solution  of  the  following 
evolutive  H.JB  equation: 

- Wt{x ,  t)  +  A W(x,  t.)  +  H(x ,  DW(x,  t)J  )  =  0,  x  G  R2,  t  G  (0, 7’),  (5.42) 

W(x,T)  =  g(x),  x  G  R2,  (5.43) 

where  the  Hamiltonian 

H(x,p,t )  =  max{max{—  uf+(x)  ■  p  —  /(.r.  //./)}.  max{— uf-(x)  ■  p  —  /(rr,«,t)}}, 

uef-G  '  '  '  u€U- 

Wt  denotes  the  partial  derivative  with  respect  to  t,  and  DW  denotes  the  partial 
derivative  with  respect  to  x. 

The  Hamiltonian  in  (5.42)  enjoys  nice  regularity  properties,  similar  to  those  in 
Proposition  5.2.8.  Based  on  this,  we  can  prove: 

Proposition  5.3.8  If  the  HJB  equation  (5.f2)  with  the  terminal  condition  (5.43) 
has  a  viscosity  solution  in  P[  (R2  x  (0,7’)),  it  is  unique. 

The  proof  can  be  found  in  [47,  60]. 
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5.3.2  The  time-optimal  control  problem 

Time-optimal  control  is  important  in  applications,  like  micro-positioning.  Since 
only  the  M  component  of  x  is  related  to  the  displacement  output,  we  consider  the 
target  set  T  to  be 

T  =  {( H ,  M)  G  R2  :  Hmin  <H<  Hmax,  M  =  M0},  (5.44) 

where  M0  €  [—MS,MS\  is  the  magnetization  corresponding  to,  say,  the  desired 
displacement. 


Remark  5.3.9  The  constraint  H  G  [Hmin,  Hmax]  in  (5.44)  reflects  the  limitation 
on  the  input  current  of  the  actuator.  Also  without  this  constraint,  the  time-optimal 
control  problem  would  be  uninteresting:  since  ./)  >  0,  i  =  1,2,3,  in  the  model  (5.3), 
the  optimal  control  would  be  u  =  uc  if  M  <  A/0,  a  =  —  uc  if  M  >  M0  and  u  =  0  if 


For  any  control  pair  «(•),  x  G  R2,  define 


oo  if  {t  :  x(t)  G  T}  =  0 

4(«)  =  < 

min{t  :  x(t)  G  Tj  otherwise 


(5.45) 


Then  the  minimum-time  function,  the  value  function  for  the  time-optimal  problem, 
is  defined  as 


T(x)  =  inf  4(a).  (5.46) 

“(•) 

Let  8T  denote  the  boundary  of  T,  and  Tc  denote  the  complement  of  T  in  R2. 
In  the  case  of  (5.44),  c)T  —  T.  The  continuity  of  T(x)  is  closely  related  to  the 
small-time  controllability  on  T  [9].  Due  to  lack  of  controllability  in  (5.4),  we  are 
unable  to  establish  the  small-time  controllability  of  (5.4)  at  T,  and  therefore  we 
have  to  take  T(x)  to  be  a  non-continuous  (not  necessarily  continuous)  function. 
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It’s  often  more  convenient  to  study  the  Kruzkov  transform  of  T(x)  [9]: 


V{x) 


A 


1  —  e  if  T(x )  <  oo 
1  if  T(x)  =  oo 


(5.47) 


We  note  that  V ( x )  is  the  value  function  for  the  optimal  control  problem  with  cost 


functional 

rtx  (cfc) 

J (x,  «(•))  =  inf  /  e~fdt,  (5.48) 

“(•)  Jo 

and  the  optimal  control  for  (5.48)  coincides  with  the  time-optimal  control. 


One  can  show  that  V(x)  is  a  non-continuous  viscosity  solution  of 


W(x)  +  H(x,  DW(x))  =  0,  x  G  Tc, 
U '(•')  =  0,  x  G  <)T. 


(5.49) 


(5.50) 


where  the  Hamiltonian 


A 


H(x,p)  =  max{ max{  —  uf+  (x)  ■  p} ,  max {-uf- (x)  •  p}}  -  1. 

UEU+  U&U— 


It  is  very  hard  to  characterize  the  value  function  as  the  unique  solution  of  the 
Dirichlet  problem  (5. 49), (5. 50)  in  the  class  of  non-continuous  functions.  To  proceed 
toward  that  direction,  we  introduce  a  generalized  solution,  the  envelope  solution , 
of  the  Dirichlet  problem. 

Consider  the  Dirichlet  boundary  value  problem: 


j  F{x.  W(x),  DW(x ))  =  0,  x  £  O 
j  W(x)  =  g(x),  x  G  dO 

where  O  C  R"  is  open,  F  :  OxRx Rn  — ►  R  continuous,  g  :  dO 
the  closure  of  O.  We  say  that  a  bounded  upper  (lower,  resp.) 
function  W  :  O  — *■  R  is  a  subsolution  ( supersolution ,  resp.) 


(5.51) 


R.  Let  O  denote 
semicontinuous 
of  (5.51)  if  it  is 
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a  viscosity  subsolution  (supersolution,  resp.)  of  F{x,W{x),DW(x))  =  0  on  O 
(Definition  5.3.1)  and  is  <  g  (>  g ,  resp.)  on  30. 

Now  denote 

S  =  {subsolutions  of  (5.51)},  Z  =  {supersolutions  of  (5.51)}. 
Definition  5.3.10  (Envelope  solutions)  [9]  Let  W  :  O  —>  R  be  locally  bounded. 

1.  W  is  an  envelope  viscosity  subsolution  of  (5.51),  briefly,  e-subsolution,  if 
there  exists  <S(IT)  C  S,  S  0,  such  that 

W(x)  —  sup  w(x),  x  €  O'. 
wes(\v ) 

2.  W  is  an  e- super solution  of  (5.51).  if  there  exists  Z(W)  C  Z,  Z  ^  ft,  such 
that 

II  '../  )  =  inf  w(x),  x  G  O; 

weZ(W) 

3.  W  is  an  e-solution  of  (5.51)  if  it  is  an  e-subsolution  and  e- supersolution. 

We  can  show: 

Proposition  5.3.11  Vflx)  (recall  Definition  5.3.2)  is  the  unique  e-solution  of  the 
Dirichlet  problem  (5. 49),  (5. 50). 

The  proof  can  be  found  in  [9] . 

5.3.3  The  exit  problem 

It’s  natural  to  consider  problems  with  the  restricted  state  space  for  control  of  a 
magnetostrictive  actuator.  Let 

<->  =  {(//•  M)  G  K2  :  Hmin  <H<  -Ms  <  M  <  MJ, 
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and  let  12,  12c  be  its  closure  and  complement  in  R2,  respectively.  The  constraint 
on  H  has  been  explained  in  Subsection  5.3.2,  while  the  constraint  on  M  is  from 


the  physics. 

For  a  control  problem  with  state  space  constraint,  in  which  any  admissible 
control  has  to  keep  the  state  within  certain  domain  throughout  the  time  period  of 
interest,  the  value  function,  if  it’s  continuous,  is  a  constrained  viscosity  solution  of 
the  corresponding  HJB  equation  [73].  As  in  the  case  of  time-optimal  control,  the 
continuity  of  the  value  function  depends  on  the  controllability  of  the  system  at  the 


domain  boundary. 

In  this  subsection  we  study  an  exit  problem  for  the  model  (5.4).  For  any  x  G  R2, 
any  measurable  control  pair  «(•),  we  denote  by  tx(a)  the  first  exit  time  of  x(t)  from 
the  open  set  Cl.  Clearly,  if  x  G  1 2,  tx(a)  —  0  for  any  «(•).  The  cost  functional  is 


defined  as 


foAa)l{x{t.),u{t))e  tdt  +  e  g(x(tx(a)))  if  tx(a)  <  oo 

J(x,a(-))  =  < 

{  fo°  l(x(t),u(t))e~tdt  if  4(a)  =  00 

(5.52) 

where  for  simplicity,  we  have  let  the  discount  factor  A  =  1.  The  running  cost 


/  is  assumed  to  be  bounded,  continuous  with  respect  to  x  and  u,  and  Lipschitz 


continuous  with  respect  to  x.  The  terminal  cost  g  is  assumed  to  be  bounded, 
continuous,  and  satisfies  a  global  principle  of  sub-optimality: 

g(x)  <  inf{  /  l(x(s),u(s))e~sds  +  e-C/(a*(f))},  Va;  G  R2,  Vi  >  0.  (5.53) 

“(•)  Jo 

Eq.  (5.53)  plays  a  role  of  a  compatibility  condition. 


Remark  5.3.12  If  we  let  12  =  Tc ,  g  =  0,  /  =  1,  we  recover  the  time-optimal 
control  problem.  Hence  the  result  of  this  subsection  applies  to  the  time-optimal 
problem  as  well. 
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We  can’t  prove  the  continuity  of  the  value  function  V(x)  =  infa(.)  J(x,  «(•)), 
again  due  to  lack  of  controllability.  V (, x )  is  a  (lion-continuous)  viscosity  solution  of 
the  H.JB  equation.  As  discussed  in  Subsection  5.3.2,  we  need  some  additional  tools 
to  single  out  the  value  function  from  all  viscosity  solutions  of  the  HJB  equation. 
In  Subsection  5.3.2,  we  used  the  notion  of  e-solutions,  here  we  will  make  use  of 
another  concept,  the  bilateral  supersolutions  of  Dirichlet  problems. 

The  HJB  equation  associated  with  the  exit  problem  is 


j  W(x)  +  H(x,  DW(x ))  =  0,  x  G  9. 
I  W(x)  =  g(x ),  x  G  cKl 


(5.54) 


where  the  Hamiltonian 


H(x,p) 


A 


max{max{— uf+{x)  ■  p  —  l(.r.  u)},  max{—  uf-(x)  ■  p  —  l(x ,  u)}}- 
ueu+  weU- 


Consider  the  first  equation  in  (5.54): 


W(x)  +  Il(.r.DW(x))  =  0,  G  f2. 


(5.55) 


Definition  5.3.13  (Bilateral  supersolutions  of  (5.55))  [9]  A  lower  semicon- 
tinuous  function  W  :  Cl  — >•  R  is  a  bilateral  (non- continuous  viscosity)  supersolution 
of  (5.55)  if  it  is  both  a  supersolution  of  (5.55)  and 

—  W(x)  -  H(x,DW(x ))  =  0,  x  E  a 


Definition  5.3.14  (Bilateral  supersolutions  of  (5.54))  [9]  Given  a  lower  semi- 
continuous  function  g  :  R2  — >  R,  a  lower  semicontinuous  function  W  :  R2  — >  R 
is  a  bilateral  supersolution  of  (5.5 f)  if  it  is  a  bilateral  supersolution  of  (5.55), 

1 V  (x)  =  g(x),  Vx  G  f!c,  and  it  is  a  super  solution  of 

-W(x)  -  H(x,DW{x))  =  0,  x  G  K2. 
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The  following  result  is  adapted  from  [9]: 


Proposition  5.3.15  V*(x)  is  the  unique  bounded  bilateral  supersolution  of  (5.54). 


5.3.4  The  nonlinear  Tioo  control  problem 


In  this  subsection,  we  consider  the  nonlinear  7 control  problem.  For  that  we 
introduce  an  exogenous  disturbance  w  into  the  model  (5.4)  and  define  a  regulated 


output  z: 

|  x(t)  =  f(x(t),u(t),d(t))  +  g(x(t))w(t) 
I  z(t)  =  h(x(t),u{t)) 


(5.56) 


where  g  is  a  continuous  function  taking  values  in  R2xp,  p  >  0,  w  takes  values  in 


Wo  C  Rp,  z  takes  values  in  Rg,  q  >  0,  and  h  is  continuous.  We  will  assume  that 
M')  e  L‘foc(R,  Wo),  i.e., 


\w(t)\2dt  <  'Co,  V  I'  <  'Co,  and  w(t)  €  Wo,  Vt. 


Jo 

Definition  5.3.16  The  (state  feedback)  sub-optimal  Hoo  control  problem  with  dis¬ 
turbance  attenuation  level  7  >  0  is  solvable  if  there  is  a  state  feedback  controller 


K(-),  such  that: 

1.  (Dissipativity)  the  closed-loop  system  is  dissipative  with  level  7,  i.e.,  there 
exists  some  finite  function  U(x)  >  0  and  U{ 0)  =  0,  such  that  starting  from  x  G  R2, 

/  \z{t)\2  -  7 2\w(t)\2dt  <  U(x),  \/x,  VT  >  0,  Vu>(-)  G  L‘foc( R+,  W0).  (5.57) 


^0 

2.  (Stability)  the  closed-loop  system  is  stable  when  w(t)  =  0. 


A  general  theory  of  dissipative  systems  has  been  studied  by  Willems  [88] ,  where 
dissipativity  is  defined  in  terms  of  an  inequality  involving  the  storage  function  and 
the  supply  rate.  (Asymptotic)  stability  of  a  dissipative  system  can  be  obtained 
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with  a  further  assumption  on  detectability  of  the  system  [44],  James  [48]  has 
shown  that  a  system  is  dissipative  if  and  only  if  a  partial  differential  inequality 
(PDI)  admits  a  lower  semi  continuous  solution  in  the  viscosity  sense.  Van  der 
Schaft  made  use  of  the  dissipativity  theory  in  the  control  setting  and  derived 
the  Hamilton- Jacobi-Isaacs  (HJI)  equation  for  nonlinear  affine  systems  with  state 
feedback  [70]. 

The  connection  between  7 f  l0C.  control  and  differential  games  has  been  well-known 
[6,  10,  49].  The  value  function  of  a  differential  game  (when  it  exists)  is  the  viscosity 
solution  of  the  HJI  equation  under  very  general  assumptions  [31].  The  relationship 
between  Hr, 0  control  and  viscosity  solutions  of  the  appropriate  HJI  equations  has 
been  pursued  by  [59,  5,  75,  92,  90,  76],  to  name  a  few.  Here  we  mention  that 
in  particular,  Soravia  has  shown  that  the  Hoo  control  problem  is  solvable  if  and 
only  if  the  corresponding  HJI  equation  admits  a  nonnegative  lower  seinicontinuous 
supersolution,  which  is  null  and  continuous  at  the  origin  [76].  This  result  is  the 
parallel  of  that  in  [48]  for  dissipative  systems. 

The  differential  game  corresponding  to  the  problem  (5.57)  is 

Vy(x)  =  inf  sup  sup  f  \z(t)\2  —  ^2\w(t)\2dt,  (5.58) 

K  w(-)  T  Jo 

where  V7  is  called  the  lower  value  function  since  the  controller  has  advantage  over 
the  disturbance.  If  we  let  the  disturbance  have  advantage  over  the  controller,  the 
corresponding  V7  will  be  called  the  upper  value  function.  In  our  problem,  since  the 
control  pair  a  =  {d,  u }  and  the  disturbance  w  are  seperate  in  the  dynamics  and 
the  cost,  the  Isaacs  condition  is  satisfied  and  V7  =  V7. 

One  can  show  V7  satisfies  the  following  HJI  equation  in  the  viscosity  sense: 

H(x,DVy(x))  =  0,  (5.59) 
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where  the  Hamiltonian 


H(x,p)  =  inf  {—q(x)w  ■  p  +  72|u’|2}  +  max{max{— uf+(x)  ■  p  —  \h(x,  w)|2}, 
w ew  '  m  u  . 

ma x{—uf-(x)  ■  p  —  \h(x, w)|2}}. 

ueU- 

From  the  previous  discussion,  if  we  can  obtain  a  supersolution  of  (5.59)  which 
satisfies  certain  conditions,  then  the  sub-optimal  Hoo  problem  is  solvable.  We 
note  that  in  general,  (5.59)  has  many  supersolutions  and  one  does  not  seek  the 
uniqueness  of  the  solution.  Some  computational  techniques  for  solving  a  PDI 
in  the  viscosity  sense  can  Ire  found  in  James  and  Yuliar  [50].  Controller  synthesis 
based  on  a  (super) solution  of  the  HJI  equation  has  been  investigated  under  various 
assumptions  in,  e.g.,  [5.  75,  92.  90],  but  it  remains  an  open  problem  for  general 
cases. 


5.4  Optimal  Control  Based  on  the  Dynamic  Hys¬ 
teresis  Model 


In  this  section  we  briefly  discuss  how  to  extend  the  viscosity  solutions  approach  to 
optimal  control  problems  based  on  the  dynamic  hysteresis  model  (3.2). 

Consider  the  system  (3.2).  Given  the  initial  memory  curve  t/j  G  lk  with  cor¬ 
responding  H,  the  cost  functional  associated  with  a  control  input  /(•)  is  defined 

ctS 

/*0C 

./(//.  r./(-H  -  /  /{//(/)..!/(/). /(/))C  x,dL 

Jo 

where  is  the  solution  of  (3.2)  under  /(•)  with  the  initial  condition  b>, 

and  A  >  0.  The  value  function  is  defined  as 

\'(U.v)  =  inf  ./(  //.  r.  /(•))• 
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Remark  5.4.1  We  know  the  true  state  space  for  (3.2)  is  W.  In  the  definitions 
above ,  we  “ augment ”  the  state  space  by  including  the  H  component ,  which  will  be 
useful  in  applying  the  viscosity  solutions  approach.  Note  the  H  component  must-  be 
consistent  with  fi. 


From  our  analysis  in  Chapter  3,  {#(•),  M(-)}  is  bounded  if  /(•)  is  so.  Hence  it’s 
natural  to  make  the  following  assumptions  about  the  running  cost  l:  l  is  continuous, 


0  <  l(H,  AI.  I)  <  C0,  VH,  A /,  V/  <=  [-/0,  J0],  and 


■U  II  .  Mr  I)  -  l(H2,  Mo,  /)|  <  Cfi\Hv  -  H2\  +  | Ml  -  M2|),  V77:L,  H2,  Mu  M2, 


VI  €  [—Iq,  /q],  where  Co  >  0,Q  >  0  are  constants,  and  70  is  the  limit  on  the  input 


current  of  the  actuator. 


From  Proposition  3.3.9  and  the  assumptions  on  Z,  one  can  easily  show  that  V 
is  bounded  and  uniformly  continuous. 

In  [4] ,  the  author  considered  an  optimal  control  problem  for  a  controlled  system 
with  the  form: 

f  y(t)  =  f(v(t),z(t),u(t)) 

<  ,  (5.60) 

[  z(t)  =  r[y,if](t) 

where  F  is  the  Preisach  operator.  The  value  function  was  shown  to  be  the  unique 
bounded,  uniformly  continuous  solution  of  a  discontinuous,  infinite  dimensional 
HJB  equation  in  an  adapted  viscosity  sense.  In  [12],  the  authors  studied  optimal 
control  of  the  following  system: 


|  V(t)  =  f(t,y(t),u(t),z(t )) 
i  z(t)  =  \  '  II.  /.’](/ ) 


(5.61) 


and  proposed  a  new  type  of  HJB  equations,  where  one  of  the  arguments  is  the 
active  set  ( P+(t )  in  Figure  2.2). 
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The  approach  in  [4]  can  be  extended  to  our  problem  after  we  rewrite  (3.2)  as 


m 


_ Cl 

1  + 


H(t) 

Co 


)) 


m 


H(t)) 


(5.62) 


where  •)  is  as  defined  in  Subsection  3.4.1.  Although  ipt  appears  in  (5.62),  it 
does  not  cause  extra  difficulty  since  tp  is  an  argument  in  the  HJB  equation. 
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Chapter  6 


Conclusions 


This  dissertation  has  been  centered  around  modeling  and  control  of  hysteresis  in 
smart  actuators.  Extensive  simulation  and  experimental  work  based  on  a  com¬ 
mercial  magnetostrictive  actuator  have  been  conducted  to  validate  the  modeling 
approach  and  the  control  schemes. 

The  contribution  of  this  dissertation  in  the  modeling  aspect  is  the  proposal 
of  a  novel  dynamic  hysteresis  model,  consisting  of  a  Preisach  operator  coupled  to 
an  ODE.  We  have  established  the  well-posedness  of  the  model  from  two  different 
perspectives.  Apart  from  being  useful  for  the  control  purpose,  the  model  presents 
many  interesting  system-theoretic  problems  due  to  its  special  structure.  We  have 
studied  the  following  properties  of  the  model:  stability  of  the  equilibria,  input- 
output  stability,  reachability  and  observability.  We  have  also  looked  at  algorithms 
to  numerically  integrate  the  system.  In  addition,  the  existence  of  periodic  solutions 
under  periodic  forcing  has  been  proved.  This  helps  validate  the  model  and  provides 
a  theoretical  basis  for  an  identification  scheme. 

We  have  pursued  the  problem  of  hysteresis  control  along  three  different  but 
connected  paths:  inverse  control,  robust  control  and  optimal  control. 


152 


The  idea  of  inverse  compensation  is  to  construct  an  inverse  operator  to  cancel 
out  the  hysteretic  nonlinearity  in  smart  actuators.  This  is  done  for  the  Preisach 
operator  based  models.  We  have  presented  parameter  identification  methods  and 
proposed  several  efficient  inversion  schemes,  all  of  which  can  be  implemented  in 
real-time.  A  special  type  of  inversion  problem,  the  value  inversion  problem,  has 
also  been  formulated  and  solved. 

Inverse  control  is  open-loop  in  nature  and  its  performance  is  susceptible  to 
model  uncertainties  and  to  errors  introduced  in  the  inversion  process.  To  combat 
this  problem,  we  have  come  up  with  a  robust  control  framework  for  smart  actua¬ 
tors.  The  inversion  error  is  modeled  as  an  exogenous  disturbance  whose  magnitude 
is  quantifible,  and  then  robust  control  techniques  are  employed  to  attenuate  its 
impact.  We  have  also  been  able  to  incorporate  the  saturation  constraint  into  the 
controller  design. 

We  have  studied  optimal  control  problems  mainly  based  on  a  low  dimensional 
hysteresis  model.  We  have  adopted  the  dynamic  programming  approach  and  stud¬ 
ied  the  Hamilton- Jacobi-Bellman  equation  satisfied  by  the  value  function  in  the 
viscosity  sense. 

There  are  several  possible  directions  to  extend  the  work  reported  in  this  dis¬ 
sertation. 

We  observe  that  the  hysteretic  behavior  of  the  actuators  varies  slowly,  possi¬ 
bly  due  to  fluctuation  of  the  temperature.  An  interesting  research  direction  is  to 
include  this  variability  in  hysteresis  modeling,  for  example,  proposal  of  a  general¬ 
ized  Preisach  operator  having  extra  parameters  to  account  for  the  variability.  We 
note  that  a  time-dependent  Preisach  model  has  been  proposed  in  [27],  where  the 
thresholds  (/?,  a)  of  hysterons  are  allowed  to  be  time- varying. 
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An  alternative  approach  to  tackle  the  above  problem  would  be  adaptive  pa¬ 
rameter  identification  and  adaptive  inverse  control.  Although  this  idea  has  been 
pursued  for  several  other  (relatively  simple)  hysteretic  operators  [80,  57],  adaptive 
inverse  control  remains  an  open  problem  for  the  Preisach  operator. 

With  fast  development  of  the  micro-electro-mechanical  systems  (MEMS)  tech¬ 
nology,  many  applications  will  involve  thousands  of  smart  sensors,  actuators,  and 
processors,  where  information  processing  and  decision  making  should  be  done  in 
a  distributed  but  coordinated  way.  In  such  a  setting,  control  with  communication 
and  compuation  constraints  is  currently  an  active  research  area,  and  how  to  take 
into  account  the  hysteresis  in  sensors  and  actuators  becomes  an  important  and 
challenging  problem . 
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Appendix  A 


Elements  of  Functional  Analysis 


In  this  appendix,  we  review  some  basic  notions  and  results  of  functional  analysis 
which  have  been  used  in  the  development  of  this  dissertation.  In  particular,  we 
introduce  metric  spaces,  Banach  spaces  and  fixed  point  theorems.  The  material  in 
this  appendix  can  be  found  in,  e.g.,  [69,  93]. 

A.l  Metric  Spaces 

Definition  A. 1.1  (Metric  spaces)  A  metric  space  <  X,p  >  is  a  nonempty  set 
X  of  elements  together  with  a  real-valued  function  p  defined  on  X  x  X  such  that 
for  all  x,  y  and  z  in  X : 

1.  p(x,  y)  >  0, 

2.  p(x,  y)  —  0  if  and  only  if  x  =  y , 

3.  p(x,  y)  —  p{y,  x),  and 

4.  p(x,  y)  <  p(x,  z)  +  p(z,  y). 

The  function  p  is  called  a  metric. 
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For  a  metric  space  <  X,p  >,  O  C  X  is  called  open  if,  Vx  G  O,  35  >  0  such 
that  {y  G  X  :  p(x,y )  <  <5}  C  O.  A  point  x  G  X  is  called  a  point,  of  closure  of  the 
set  E  C  X  if,  V<5  >  0,  By  G  E,  such  that  p(x ,  y)  <  A  The  set  of  points  of  closure 
of  E  is  denoted  as  E.  A  set  E  is  called  closed,  if  E  —  E.  If  E  —  X,  E  is  said  to  be 
dense  in  X.  A  metric  space  <  X,  p  >  is  called  seperable  if  it  has  a  subset  D  which 
has  a  countable  number  of  points  and  is  dense  in  X. 

A  function  /  on  a  metric  space  <  X,  px  >  into  a  metric  space  <  Y,  pY  >  is 
said  to  be  continuous  at  x  if,  Ve  >  0,  >  0  such  that  if  px(x,z )  <  5,  then 

pY  (/ (x) ,  / (z))  <  e.  It  is  called  continuous  if  it  is  continuous  at  every  x  G  X. 
f  is  called  injective  if  it  is  one-to-one,  and  is  called  surjective  if  it  is  onto,  i.e. , 
f{X)  =  {z  G  Y  :  z  =  f(x)  for  some  x  G  A"}  =  Y. 

A  sequence  {xn}  from  a  metric  space  <  X,p  >  converges  to  x  G  X  if  given 
e  >  0,  there  is  an  N  such  that  p(x,xn)  <  e  for  all  n  >  N.  The  point  x  is  called 
the  limit  of  {, xn },  and  we  write  xn  — *•  x.  We  call  x  a  cluster  point  of  {xn}  if  a 
subsequence  of  {xn}  converges  to  x. 

A  sequence  {, xn }  from  a  metric  space  <  X,p  >  is  called  a  Cauchy  sequence ,  if 
given  e  >  0,  there  is  an  N,  such  that  p(xn,xm)  <  e  for  all  n,  m  >  N.  A  convergent 
sequence  is  a  Cauchy  sequence,  but  the  converse  is  not  generally  true.  If  the  metric 
space  has  the  property  that  every  Cauchy  sequence  converges,  we  say  that  space 
is  complete. 

A  collection  U  of  open  sets  in  a  metric  space  is  an  open  covering  of  a  set  E  if 
E  is  contained  in  the  union  of  the  sets  in  U.  A  set  E  is  said  to  be  compact,  if  every 
open  covering  U  of  E  has  a  finite  subcovering,  i.e.,  if  there  is  a  finite  collection 
{Oi,  0-2,  ■  ■  ■  ,  On}  C  U  such  that  E  C  U;A,  O*. 
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We  say  that  a  set  E  in  a  metric  space  <  X,  p  >  is  relatively  sequentially 
compact  if  any  sequence  {xn}  in  E  has  a  convergent  subsequence  xnk  ->3:6  X. 

E  is  sequentially  compact  if  any  sequence  {xn\  in  E  has  a  convergent  subsequence 
Xnk  ->  X  E  E. 

For  a  metric  space,  the  notions  of  compactness  and  sequential  compactness  are 
equivalent. 

A  family  T  of  functions  from  a.  metric  space  <  X ,  px  >  to  a  metric  space 
<  Y,py  >  is  called  equicontinuous  at  x  E  X  if,  Ve  >  0,  3  an  open  set  O  containing 
x  such  that  pY'(f(x),f(z))  <  e  for  all  z  E  O  and  /  E  T.  The  family  is  said  to  be 
equicontinuous  on  X  if  it  is  equicontinuous  at  each  point  x  E  X . 

Theorem  A. 1.2  (The  Ascoli-Arzela  theorem)  Let  T  bean  equicontinuous  fam¬ 
ily  of  functions  from  a  seperable  space  <  X,  px  >  to  a  metric  space  <  Y,py  >■ 
Let  {/n}  be  a  sequence  in  T  such  that  for  each,  x  E  X ,  the  closure  of  the  set 
{fn(x)  :  n  >  0}  is  compact.  Then  there  is  a  subsequence  {fnk}  that,  converges 
pointwise  to  a  continuous  function  f ,  and  the  convergence  is  uniform  on  each 
compact,  subset  of  X . 

Corollary  A.  1.3  Let,  T  be  an  equicontinuous  family  of  real-valued  functions  on  a 
seperable  space  X .  Then  each  sequence  {/„}  in  T  which  is  bounded  at,  each  point, 
has  a  subsequence  {fnk}  that  converges  pointwise  to  a  continuous  function,  and  the 
convergence  is  uniform  on  each  compact,  subset  of  X . 

A. 2  Banach  Spaces 

In  the  following  K  =  R  or  K  =  C,  where  R  denotes  the  set  of  real  numbers  and  C 
denotes  the  set  of  complex  numbers. 
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Definition  A. 2.1  (Vector  spaces)  A  set  X  of  elements  is  called  a  vector  space 
for  linear  space)  over  IK  if  we  have  a  function  +  on  X  x  X  to  X  and  a  function  ■ 
on  IK  x  A  to  X  that  satisfy  the  following  conditions:  for  airy  x,y  E  X,  a,  (3  E  K, 

1.  x  +  y  =  y  +  x, 

2 .  (x  +  y)  +  z  =  x  +  (y  +  z), 

3.3  d  E  X,  such  that  x  +  9  =  x,  V:c  E  X, 

4.  a  ■  (x  +  y)  =  a  •  x  +  a-  y, 

5.  (o  +  ,6)  ■  x  =  a  ■  x  +  (3  ■  x, 

6.  a  ■  ((3  ■  x)  =  (a/3)  ■  x, 

7. 0  •  x  =  9, 1  ■  x  —  x. 

For  a  vector  space  X  over  K,  the  elements  u  \ ,  u>.  •  •  •  ,  un  of  X  are  called  linearly 
independent  if  and  only  if 


+  Q2n_)  +  •  •  •  +  o  —  0,  on  E  K,  1  ^  i,  ^  N, 

implies  Oj  =  0,  1  <  i  <  N.  The  maximal  number  N  of  linearly  independent 
elements  in  X  is  called  the  dimension  of  X.  We  say  X  is  finite  dimensional  if 
N  <  oo,  and  X  is  infinite  dimensional  otherwise. 

Definition  A. 2. 2  (Norms)  A  nonnegative  real-valued  function  ||  •  ||  defined  on 
a  vector  space  is  called  a  norm,  if  for  x,  y  EX,  a  E  K, 

1.  ||  x  J|;=  0  if  and  only  if  x  =  9, 

2.  ||  x  +  y  ||  <||  x  j:|  +  ||  y  ||, 

3.  ||  ax  ||=  |ct |  ||  x  ||  . 
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A  normed  vector  space  becomes  a  metric  space  if  we  define  a  metric  p  by 
p(x,y )  =||  x  —  y  || .  If  a  normed  vector  space  is  complete  in  this  metric,  it  is  called 
a  Banach  space. 

A  point  x  is  called  a  fixed  point  of  a  mapping  /  if  f{x)  =  x. 

Theorem  A. 2. 3  (The  Banach  fixed  point  theorem)  Let  E  be  a  closed  nonempty 
subset  of  a  Banach  space  X .  Let  the  mapping  f  :  E  — >  E  be  k- contractive,  i.e.,  for 
all  x,  y  G  E, 

II  fix)  -  f(y)  ||  <  k  ||  x  -  y  ||, 

where  0  <  k  <  1.  Then  there  exists  a  unique  x*  G  E  satisfying  f(x*)  =  x* ,  and 
starting  from  any  x0  G  E,  we  have  xn  — >•  x* ,  inhere 

Xn  =  -l  )- V //  >  1. 

The  Banach  fixed  point  theorem  is  also  known  as  the  contraction  mapping 
theorem. 

A  subset  E  of  a  vector  space  X  is  called  convex  if  ax  +  (1  —  a)y  G  E  for  all 
x,y  G  E,  0  <  a  <  1. 

Theorem  A. 2. 4  (The  Brouwer  fixed  point  theorem)  Let  E  be  a  compact, 
convex,  nonempty  subset  of  a  finite  dimensional  normed  vector  space  X .  Let  the 
mapping  f  :  E  — >  E  be  continuous.  Then  f  has  a  fixed  point  in  E. 

Let  X  and  Y  be  normed  spaces  over  K.  The  mapping  /  :  E  C  X  — ►  Y  is  called 
compact  if  /  is  continuous  and  it  maps  bounded  sets  into  relatively  compact  sets. 

The  Brouwer  fixed  point  theorem  lias  been  generalized  to  the  setting  of  a  Ba¬ 
nach  space: 
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Theorem  A. 2. 5  (The  Scliauder  fixed  point  theorem)  Let  E  be  a  bounded, 
closed,  convex,  nonempty  subset  of  a  Banach  space  X .  Let  the  mapping  f  :  E  — >  E 
be  compact.  Then  f  has  a  fixed  point  in  E. 
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Appendix  B 


Measure  and  Integration 


The  material  in  this  appendix  can  be  found  in,  e.g.,  [69]. 

B.l  Measure 

A  a -algebra  srf  is  a  family  of  subsets  of  a  given  set  X  which  contains  0  and  is 
closed  with  respect  to  complements  and  with  respect  to  countable  unions.  A  set 
function  v  is  a  function  which  assigns  an  extended  real  number  to  certain  sets. 

Definition  B.1.1  (Measurable  space)  A  measurable  space  is  a  couple 
consisting  of  a  set  X  and  a  o -algebra  s/  of  subsets  of  X .  A  subset  A  of  X  is  called 
measurable  (with,  respect  to  )  if  A  <E  srf . 

Definition  B.l. 2  (Measure)  A  measure  u  on  a  measurable  space  (X.  .(/J)  is  a 
nonnegative  set  function  defined  for  all  members  of  srf  and  satisfying  z/(0)  =  0  and 

K0*w  =  f>«), 

i= 1  i=  1 

for  any  sequence  {Ej}  of  disjoint  measurable  sets.  By  a  measure  space  (X,se/,u), 
we  mean  a  measurable  space  ( .V .  .'■  / )  together  with  a  measure  u  defined  on  srf . 
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A  measure  v  is  called  finite  if  u(X)  <  oo.  It  is  called  a -finite  if  there  is  a 


sequence  { Xn }  of  sets  in  sA  such  that 


X=\J  Xn, 

n=l 


and  y{Xn)  <  oo. 


A  measure  space  ( X ,  u)  is  complete  if  srf  contains  all  subsets  of  sets  of 


measure  zero. 


Proposition  B.1.3  If  (X,  &/,  u)  is  a  measure  space ,  then  we  can  find  a  complete 
measure  space  {X.sAq.vq)  such  that 

1.  Ac  M o, 

2.  E  G  A=>  y{E)  =  uq(E),  and 

3.  E  G  Aq  E  =  A  U  B  where  B  e  sA  and  A  C  C,C  E  &/,  v(C)  =  0. 


The  measure  space  {X,srfo,Vo)  given  in  Proposition  B.1.3  is  called  the  completion 
of  (X,  &/,  u). 


For  a  metric  space  <  X,  p  >,  the  Borel  algebra  38  is  the  smallest  cr-algebra 


containing  all  the  closed  subsets  of  X ,  and  any  member  of  38  is  called  a  Borel  set. 


A  Borel  measure  v  is  a  measure  defined  on  the  Borel  algebra  38  or  the  completion 
of  such  a  measure.  We  assume  that  a  Borel  measure  has  finite  values  on  compact 
sets. 

For  the  space  RA\  the  unique  Borel  measure  that  assigns  the  standard  volume 
n,'=i  (pi  —  ai )  to  every  rectangular  cube 


ai,  bi]  x  •  •  •  x  [a,v,  6,v], 
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is  called  the  Lebesgue  measure  . 

Let  ( X ,  srf)  be  a  fixed  measurable  space.  Two  measures  vx  and  u-2  are  called 
mutually  singular  if  there  are  disjoint  sets  A  and  B  in  sA  such  that  X  —  AUB  and 
vx(A)  =  V‘2 (B)  =  0.  A  measure  ux  is  said  to  be  absolutely  continuous  with  respect 
to  the  measure  v2  if  (A)  =  0  for  each  set  A  satisfying  u2(A)  —  0. 

Definition  B.1.4  (Signed  measure)  A  signed  measure  on  the  measurable  space 
{X.sA)  is  an  extended  real-valued  set  function  v  defined  for  the  members  of  sA  and 
satisfying  the  following  conditions: 

1.  v  assumes  at  most  one  of  the  values  oo,  — oo, 

2.  v{flf)  =  0,  and 

3.  for  any  sequence  {£)}  of  disjoint  measurable  sets 

OO  OO 

Z'dJ  Ei)  =  u(Ei)i 

i=i  i=l 

where  the  equality  is  taken  to  mean  that  the  series  on  the  right  converges 
absolutely  ifu{{jEj)  is  finite  and  that  it  properly  diverges  otherwise. 

Proposition  B.1.5  (Jordan  decomposition)  A  signed  measure  u  on  the  mea¬ 
surable  space  ( X ,  s/)  can  be  uniquely  decomposed  as  u  =  u+  —  u~ ,  where  v+  and 
u~  are  mutually  singular  measures  on  (X,x/). 

The  measure  \u\  defined  by  \u\  =  u+  +  is  called  the  absolute  value  of  u. 


B.2  Integration 


For  a  measurable  space  (X,  &/),  a  function  /  :  X  — >  [—00,00]  is  called  measurable 


if 


f-\U)  =  {x  G  X  :  f{x)  eU}estf, 
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for  any  open  subset  U  of  [—00,  00],  and  /  is  called  Borel  measurable  if  the  srf  is  the 
Borel  algebra  S3.  A  function  6  is  called  simple  if  it  is  a  finite  linear  combination 


"(•'■)  =  <-i\E,  (-I'). 


(B.l) 


i--i 


of  characteristic  functions  of  measurable  sets  E). 

For  a  measure  space  ( X ,  &/,  o),  if  E  is  a  measurable  set  and  <fi  is  a  nonnegative 
simple  function  of  the  form  (B.l),  we  define 

N 
A 


cj)du  —  Ciu(Ei  n  E). 


i= 1 


For  a  nonnegative  measurable  function  /  and  a  measurable  set  E,  we  define 


fdu  £ 


sup  /  (j)dv. 
(p  simple  .0  :/  J E 


A  measurable  function  /  :  X  — ►  [—’00,00]  is  integrable  on  X  with  respect  to  u, 
if  fx  \f\do  is  finite.  For  an  integrable  function  /,  for  a  measurable  set  E,  we  define 


fdu^f+du-  I  f_dv, 
where  /+  =  max{/,  0}  and  /_  =  max{— /,  0}. 


J  E  JE 

^  r  r>  r\ 't  i  a  ^ 


Theorem  B.2.1  (The  Radon-Nikodym  theorem)  Let  (X,  &/,  of)  be  a  a -finite 
measure  space,  and  let  o2  be  a  measure  defined  on  srf  which  is  absolutely  continuous 
with  respect  to  o} .  Then  there  is  a  nonnegative  measurable  function  p  such  that 


02(E)  =  /  pdoi,  ME  E  s3 . 

IE 
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Appendix  C 


Basics  of  Robust  Control 


In  this  appendix  we  collect  some  fundamental  results  of  robust  control  from  [29]. 
Following  [29],  we  carry  out  the  discussions  in  the  discrete-time  setting.  For  other 
references  on  linear  robust  control,  in  particular,  Tioo  control,  please  see  [33,  39,  94], 
A  dynamic  game  approach  for  74,x  control  can  be  found  in  [10].  For  a  treatment 
on  nonlinear  74oo  control,  please  refer  to  [43]. 


C.l  Signals  and  Systems 

Denote  Z+  the  set  of  nonnegative  integers.  Denote  Zn(Z+)  the  space  of  all  vector¬ 
valued  real  sequences  on  Z+,  of  dimension  n,  i.e.,  Va;  =  {.r[/,:]}^t0  G  ln( Z+),  x[k]  G 
Rn,  k  =  0, 1,  •  •  • .  For  an  integer  1  <  p  <  oo,  we  define  the  space 

i;  =  {x  G  /"( Z+)  :||  a;  ||p<  .oo}, 

where 

II x  Ip=  EJ]l  xAk\\p)» 

k= 0  i  1 
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and  Xi[k]  denotes  the  i-th  component  of  x[k\.  For  instance,  If  is  the  space  of  finite 
energy  signals.  For  p  —  oo,  we  define  to  be  the  space  of  bounded  magnitude 
signals: 

=  {x  G  Zn(Z+)  :  sup  max  | Xi  [k]  \  <  oo}. 

k  i 

A  system  T  is  an  operator  between  two  signal  spaces  X  and  Y.  Denote  by 
Vk,  k  G  Z+,  the  truncation  operator  on  T{ Zn),  i.e., 

Vk(x{0\, x[l],  ■  ■  ■ )  =  (x[0],x[l],  •  •  •  ,  x[k],  0, 0,  •  •  • ). 

Denote  by  S  the  unit  shift  operator,  i.e., 

5(ar[0],ar[l],---)  =  (0, ar[l], x[2],  ■  ■  ■ ). 

Definition  C.1.1  (Linearity,  causality,  and  time-invariance)  An  operatorT  : 

X  — ►  Y  is  linear  if 

T {ax  +  Ay)  =  cYT (x)  +  @T(y),  Vo,  (3  G  R,  Wx,  y  G  X. 

An  operator  T  is  causal  if  for  all  k .  VkT  =  VkTVk,  and  is  strictly  causal  if 
VkT  =  VkTVk- 1  for  all  k.  T  is  time-invariant,  if  ST  =  TS . 


Definition  C.1.2  (Stability)  Let  X ,  Y  be  two  normed  linear  spaces.  An  opera¬ 
tor  T  :  A'  —  Y  is  stable  if 


I!  ^  I!  A  II  Tx  || y  ^ 

II  1  ||=  Slip  — - - -  <  'OO, 

x^O  II  x  \\x 

where  ||  •  \\x  and,  ||  •  ||i-  denote  the  norms  on  X  andY ,  respectively, 
the  induced,  norm  ofT. 


We  call  ||  T 


We  now  characterize  classes  of  linear  time-invariant  (LTI)  systems  on  Iff,  and 
If.  Causality  is  implicitly  assumed  for  a  LTI  system. 
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Let  7 Z  be  a  LTI  system  from  Zn(Z+)  to  Zm(Z+).  Then  for  any  x  G  ln( Z+), 


y  =  TZx  G  Zm(Z+)  can  be  expressed  as 

k 

y[k }  =  R[k  —  i]x\i],  k  —  0, 1,  •  •  •  , 

i=0 

where  /?,  =  }^L0  G  Zmxn(Z+)  is  called  the  impulse  response  of  1Z.  Write  the 

(bj)-th  component  of  R[k]  as  R,,j  [A:] ,  then  R{j  G  /'( Z+),  1  <  i  <  m,  1  <  j  <  n. 
From  now  on  we  will  denote  1Z  by  its  impulse  response  i?. 

Theorem  C.1.3  A  LTI  system  R.  is  stable  from  1 %  to  l™  if  and  only  if 

n 

II  R  II  1=  niMC  II  Rv  II  L<  '30,  (C.l) 

!<z<m  * — J 
3  = 1 

and  ||  R  ||  i  is  the  induced  norm  of  the  system. 

Remark  C.l. 4  Recall  that  a  matrix  A  =  (Ajj)  G  Rwxn  is  a  mapping  from  R* 
to  Rw.  When  we  equip  the  vector  spaces  R"  and  Rm  with  the  \  •  |oo  norms,  the 
induced  norm  of  A  is  the  so  called  l  \  norm  |A|j  =  inax:i<j<m  ^"=1  \Aij\.  Now  write 
R  =  ( Rij )  with  Rjj  =||  Rjj  ||:L.  Then  ||  R  ||i  defined  in  (C.l)  is  just  composition 
of  the  1 1  norm  of  R  with  the  R  norms  of  R^’s.  This  is  the  reason  we  use  notation 
II  •  Hi.  for  R. 


We  will  denote  by  R  the  space  of  LTI  systems  with  finite  Zoo-induced  norm. 
For  R  G  Zmxn(Z+),  we  define  the  A-transform  of  R: 

OO 

m  A)  =  £fl[fc]A*. 

k= 0 

Note  the  A-transform  is  connected  to  the  ^-transform  by  z  =  A  remark  on 
notation:  for  a  LTI  system,  say  G.  we  will  also  use  G  to  denote  its  impulse  response 
and  will  use  G  to  denote  its  A-transform. 
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Figure  C.l:  Feedback  connection. 

Denote  by  TCffn  the  space  of  complex  functions  that  are  analytic  on  the  open 
unit  disc  and  bounded  on  the  unit  circle,  and  denote  by  7 ZH™xn  the  space  of  real 
rational  functions  inside  H™xn.  For  any  R  €  the  TL^  norm  is  defined  as 

II  R  II Woe  =  sup amax[R(eje)], 

0 

where  crmax  denotes  the  largest  singular  value  of  the  matrix. 

Theorem  C.l. 5  A  LTI  system  R  is  stable  from  1%  to  l™  if  and  only  if 

II  R  II Woo  <  oo. 

We  write  ||  R  11,00=11  R  ||Wxj. 

The  following  result  is  at  the  heart  of  the  robust  control  theory  and  it  applies 
to  general  nonlinear  time-varying  systems.  Consider  the  feedback  connection  of 
two  systems  G%  and  G2  as  shown  in  Figure  C.l.  The  closed-loop  system  is  said  to 
be  well  posed  if  for  any  ip,  u2,  there  exists  a  unique  solution  y L,  y2. 

Theorem  C.l. 6  (Small  gain  theorem)  Let  G \  :  If  — >  If1  and  G2  :  l™  — >  If  be 

two  lp-stable  systems  and  assume  that  the  closed-loop  system  is  well  posed.  Then 
the  closed-loop  system  is  lp- stable  (taking  ip,  u2  as  the  input ,  y\,y2  as  the  output) 
if  ||  Gi  ||  •  ||  G2  ||  <  1. 
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(a)  (b) 


Figure  C.2:  General  setup. 

C.2  Parametrization  of  Stabilizing  Controllers  and 
Achievable  Closed-Loop  Maps 


Figure  C.2(a)  shows  a  general  setup  for  formulating  performance  objectives  (tra¬ 
jectory  tracking,  disturbance  attenuation,  etc.),  where  u  is  the  controlled  input,  y 
is  the  measured  output,  w  is  the  exogenous  input  and  z  is  the  regulated  output. 
The  operator  G  is  a  2  x  2  block  matrix  mapping  w  and  u  to  2  and  y: 


G 1 1 

G 1 2 

(  \ 

w 

vG 

G21 

G-2‘2 

V  w  / 

K  is  the  feedback  controller.  As  we  will  see  shortly,  to  solve  the  robust  control 
problem  (either  robust  stability  problem,  or  robust  performance  problem,  or  both), 
one  always  ends  up  with  the  problem  of  finding  a  stabilizing  K  to  minimize  the 
induced  norm  of  the  closed-loop  map  from  w  to  2. 

Consider  Figure  C.2(b).  Let  H(G,K )  denote  the  following  map: 


/C 


u 


W 


=  H(G,  K) 


(  ,  \ 


Vi 

V  ) 


We  assume  the  map  H(G,  K)  is  well  posed. 
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Definition  C.2.1  The  closed-loop  system  is  lp-stable  if  the  lp-induced  norm  of 
H(G,K)  is  finite.  In  such  a  case,  I\  is  said  to  be  stabilizing  in  the  lp  sense. 

We  will  be  interested  in  the  map  <t>  between  w  and  z: 

$  =  Gn  +  Gl2K(I  -  G22K)~lG2l.  (C.2) 

In  literature,  $  is  called  the  lower  Linear  Fractional  Transformation  (LFT)  of  G 
and  K  and  it  can  be  written  as  $  =  Ff  G,  K). 

We  now  restrict  ourselves  to  the  case  that  G  is  a  LTI  system.  We  make  the 
assumption  that  all  the  unstable  poles  of  G  are  reachable  from  u  and  observable 
from  y.  With  this  assumption,  K  stabilizes  G  if  and  only  if  it  stabilizes  G 22  and 
it  suffices  to  parametrize  all  stabilizing  controllers  for  G 22. 


Definition  C.2. 2  (Doubly-coprhne  factorization)  A  doubly- coprime  factor¬ 


ization  of  G22  is  a  set  of  maps  N,  M.  N ,  A L  with  G 22  =  NM  =  M  lN  satisfying 


( 


X  -Y 
-N  M 


\ 


J 


( 


V 


M  Y 
N  X 


=  L 


(C.3) 


for  some  stable  X ,  Y,  X  and  Y,  where  I  is  the  identity  map.  M  and  N  are  called 
the  right  coprime  factors  of  G22  while  M  and  N  are  called  the  left  coprime  factors 
of  G22. 


Theorem  C.2. 3  Let  a  doubly- coprime  factorization  of  G22  be  given  as  in  (C.3). 
All  stabilizing  controllers  are  given  by 

K  =  (Y  -  MQ)(X  -  NQ)~l  =  (X  -  QN)-l{Y  -  QM ),  Q  is  stable.  (C.4) 


170 


Remark  C.2.4  Theorem  C.2.3  gives  a  complete  pammetrization  of  stabilizing 
controllers  in  the  following  two  senses: 

1.  If  Q  is  loo-stable ,  K  is  stabilizing  in  the  loo.  sense;  if  Q  is  l2- stable,  K  is 
stabilizing  in  the  l2  sense. 

2.  The  pammetrization  covers  cases  of  LTI  controllers,  linear  time-varying  (LTV) 
controllers  and  nonlinear  controllers.  For  instance ,  all  LTI  stabilizing  con¬ 
trollers  in  the  loo  sense  are  parametrized  by  Q  G  l\,  all  finite-dimensional 
LTI  (FDLTI)  stabilizing  controllers  in  the  loo.  sense  are  parametrized  by 
Q  G  TTHoo,  all  LTV  stabilizing  controllers  in  the  lQ 0  sense  are  parametrized 
by  Q  in  the  set  of  all  LTV  loo-stable  operators,  and  all  nonlinear  time-varying 
stabilizing  controllers  in  the  sense  are  parametrized  by  Q  in  the  set  of  all 
nonlinear  time-varying  loo-stable  operators. 

Corollary  C.2.5  If  G-n  is  stable,  then  the  pammetrization  of  stabilizing  con¬ 
trollers  is  given  by 

K  =  -Q(I  -  G22Qrl.  (C.5) 


Proof  When  G22  is  stable,  we  obtain  a  doubly-coprime  factorization  by  letting 
M  =  X  =  /,  M  =  X  =  /,  Ar  =  N  =  G 22  and  Y  =  Y  =  0,  where  the  dimensions 
of  the  identity  matrices  /  and  the  zero  matrix  0  should  be  compatible  with  the 
operators.  ■ 

Substituting  (C.4)  into  (C.2),  we  get  a  parametrization  of  achievable  closed- 
loop  maps  from  w  to  z: 

$  =  E  -  UQV,  Q  is  stable,  (C.6) 
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Figure  C.3:  Robust  stability  analysis. 

where  E  =  Gn  +  G12YMG21,  U  —  G12M,  and  V  —  MGoj  .  We  note  that  the 
closed-loop  map  is  affine  in  the  free  parameter  Q ,  and  as  we  will  see,  this  plays 
a  fundamental  role  in  developing  synthesis  techniques  for  robust  control. 

C.3  Stability  and  Performance  Robustness 

In  this  section,  we  give  the  sufficient  and  necessary  condition  for  a  closed-loop 
system  to  be  stable  in  the  presence  of  uncertainties.  We  also  indicate  how  to  ad¬ 
dress  the  performance  robustness  problem  by  converting  it  to  a  stability  robustness 
problem.  Here  by  “performance”  we  mean  the  induced  norm  of  the  mapping  from 
the  exogenous  input  w  to  the  regulated  output  z. 

Consider  Figure  C.3,  where  A  represents  the  uncertainty  block.  Let  A  denote 
the  class  of  uncertainties.  A  can  have  certain  structure,  e.g.,  a  block  diagonal 
structure,  as  a  result  of  our  knowledge  about  uncertainty  locations.  The  structural 
information  of  A  is  exploited  to  reduce  the  conservativeness  in  robust  controller 
synthesis. 

Let  M  =  Fi(G,K). 

Definition  C.3.1  (Structured  norm)  Given  the  class  A  of  uncertainties  and 
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j),  the  structured  norm,  (SN)  of  M  is  defined  as 


snajm) 


A 


1 


infAeA-QI  A  || ip-ind'  (I  ~  M/S)-1  is  not  lp-stable}  ' 
where  ||  •  \\i  ind  denotes  the  lp-induced  norm.  If  for  any  A  G  A.  (/  —  M A) 
lp-stable ,  then  SNA^P(M)  is  defined  to  be  0. 


is 


Define  BAp  =  {A  G  A  :||  A  \\ip-ind<  !}•  From  Definition  C.3.1,  we  have  the 
following  theorem: 


Theorem  C.3.2  (Structured  small  gain  theorem)  The  feedback  connection  of 
M  and  A  ( Figure  C.3)  is  stable,  for  all  A  G  BA  ;)  if  and  only  if  SNAp(M)  <  1. 

Remark  C.3. 3  From  the  small  gain  theorem,  SNA:p(M)  <||  M  \\ip-ind- 

To  make  use  of  Theorem  C.3.2.  one  need  know  how  to  compute  SNAp{M) 
or  its  upper  bound.  Define  the  set  D  =  {D  :  D,D~L  are  /^-stable,  D~l  AD  G 
A,  and  ||  D~lAD  ||/p-md=||  A  \\tp-ind  for  all  A  G  A).  From  Definition  C.3.1, 
SNa.p(M)  =  SNAp(D~ 1 MD).  Then  by  the  small  gain  theorem,  we  have 

SNaJM)  <  mfD  II  I)  'MI)  \\lp_ind  .  (C.7) 

It  turns  out  that  equality  holds  in  (C.7)  for  many  important  cases. 

Theorem  C.3. 4  Let  A  =  {diag( Ai,A2,---  ,  An)  :  each  A,  is  a  nonlinear  or 
linear  time-varying  uncertainty  block  of  dimension  U  x  li}  1  <  i  <  n}.  Define 

D  =  {diag(d[  1^ ,  dMi2,  •  •  •  ,  dnJin )  :  cf  G  R,  cf  >  0,  1  <  %  <  n}, 

where  denotes  the  identity  matrix  of  dimension  f .  Then 

SNAoo(M)  =  inf  ||  D  'MD  h,SNA2(M)  =  inf  ||  I)  'MD  |U  .  (C.8) 

•  '  7  11  11  X  7  A i  T~\  11  11  x  7 
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Corollary  C.3.5  ||  M  ||  i  <  1  if  and  only  if  the  closed-loop  system  in  Figure  C.3 
is  loo-stable  for  arbitrary  nonlinear  or  LTV  A  with  ||  A  ||Joo_1-,ld<  1.  Similarly , 
II  M  ||oo<  1  if  and  only  if  the  closed-loop  system  is  l2- stable  for  arbitrary  nonlinear 
or  LTV  A  with  ||  A  \\h.ind<  1. 


For  the  case  of  FDLTI  uncertainties,  the  concept  of  structured  singular  value 
is  useful.  Let  A LTJ  =  {diag(Av,  A2,  •  •  •  ,  An)  :  A,  G  1  <  i  <  n}. 


Definition  C.3. 6  (Structured  singular  value  //)  Consider  the  feedback  connec¬ 
tion  in  Figure  C.3.  For  each  9  G  [0,2ir],  the  structured  singular  value 


Ta 


A 


1 


inf  a  e  a  LT7  { <^  max  [  A  (e-7  e )  ]  :  det{I  -  MA)(eje)  =  0} 
and  if  det(I  —  M A) (<■'")  0  for  all  A  G  A LTJ ,  then  ha [M{eJ$)]  is  defined  to  be  0. 


Theorem  C.3. 7  Let  l  =  max,  /,'.  Then 

SNAltiAM)=  sup  /.iA{M(eje)\,  (C.9) 

6e[0,27r] 

4=-  sup  fiA[M(eje)}  <  SNAlti,oo(M)  <  Vi  sup  fiA[M(eje)].  (C.10) 

VI  #e[0,27r]  0e[0,27r] 

For  single-input  single- output  (SISO)  blocks ,  equality  holds  in  (C.10). 


Consider  Figure  C.4.  From  Corollary  C.3.5,  the  robust  performance  problem 


(System  I)  can  be  converted  to  a  robust  stability  problem  by  adding  a  fictitious 


uncertainty  block  mapping  z  to  w  (System  II). 


Theorem  C.3. 8  Let  A  G  A.  Let  AP  be  the  class  of  arbitrary  nonlinear  or  LTV 
uncertainties.  Define  the  new  set  of  uncertainties 

A  =  {A  =  diag(A,  AP)  :  A  G  A,  AP  G  AP}. 

Denote  the  mapping  from  w  to  z  as  Tzw.  Then  for  p  =  oo  or  p  =  2,  System  I  is 
stable  and  ||  Tzw  || ip-ind<  1  for  all  A  G  BA  p  if  and  only  if  SN^  (M)  <  1. 
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System  I  System  II 


Figure  C.4:  Performance  robustness  vs.  stability  robustness. 

C.4  The  l\  Model  Matching  Problem 


We  have  seen  from  the  previous  section  that  solving  a  robust  control  problem 
involves  computation  of  inf  Den  II  D~lMD  \\ip~ind-  Recall  M  =  F)(G ,  I\),  therefore 
the  controller  synthesis  problem  is  often  posed  as: 


inf  inf 
stabilizing  k  DeD 


D~lMD 


(C.ll) 


Optimizing  (C.ll)  simultaneously  with  respect  to  K  and  D  is  hard.  One 
method  to  approximately  solve  (C.ll)  is  the  so  called  D-K  iteration  method.  The 
iteration  goes  as  follows: 


•  Step  1.  For  a  fixed  D  e  D,  solve 


inf  ||  D~lMD  \\ip-ind,  (C.12) 

stabilizing  i< 

and  denote  the  optimal  controller  as  K*. 

•  Step  2.  Fix  K*,  and  search  for  the  optimal  D*  to  minimize  ||  I)  Ml)  \\ip-ind- 

•  Step  3.  Go  back  to  Step  1  with  D  =  D*. 
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$ 


Figure  C.5:  The  model  matching  problem. 

By  redefining  G,  the  minimization  problem  in  Step  1  above  is  equivalent  to  a 
model  matching  problem  (recall  Eq.  (C.6)  and  Remark  C.2.4) 

inf  ||  E-UQV  \\lp_ind,  (C.13) 

q  stable 

as  illustrated  in  Figure  C.5. 

For  p  =  2.  the  problem  (C.13)  becomes  the  7 model  matching  problem  when 
we  consider  Q  G  TZ'Hoo  and  it  can  be  elegantly  solved  through  the  theory  of  Hankel 
operators.  Since  we  are  mainly  interested  in  the  case  p  =  oo  in  robust  control  of 
smart  actuators,  here  we  focus  on  how  to  solve  the  l\  model  matching  problem: 

inf  ||  E  -  UQV  II  ,  .  (C.14) 

Q<Eh 

C.4.1  Interpolation  conditions 

The  approach  to  solve  (C.14)  is  to  first  characterize  the  subspace 
S  =  {Reh:  R=  UQV  for  some  Q  G  h} 
and  then  solve  the  minimum  distance  problem 

inf  ||  E-  R  ||i  .  (C.15) 

ReS 

An  element  R  G  A  should  preserve  the  zero  structures  of  U  and  V:  intuitively 
speaking,  non-minimum  phase  zeros  (zeros  inside  the  open  unit  disk  V)  of  U  and 
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V  can  not  be  cancelled  by  poles  of  Q  since  Q  is  stable.  In  addition,  some  rank 
conditions  may  need  to  be  satisfied.  We  first  review  the  concepts  of  zeros  and  poles 
for  a  rational  matrix  G( A). 

Definition  C.4.1  A  square  polynomial  matrix  P( A)  is  called  unimodular  if  its 
determinant  is  a  nonzero  constant. 

By  definition,  unimodular  matrices  have  polynomial  inverses. 

Theorem  C.4.2  Let  G( X)  be  an  m  x  n  rational  matrix  of  normal  rank  r  (i.e.,  of 
rank  r  for  almost  all  X).  Then  G( A)  can  always  be  factored  as: 

G(  A)  =  LG(X)MG(X)RG(X ),  (C.16) 

where  LG( A)  and  Rg( A)  are  unimodular  matrices  of  the  appropriate  dimensions, 
and 


is  m  x  n ,  where  the  monic  polynomials  {c'?(A),  d’j(A)}  are  coprime  for  all  i  = 
1,2,-  ••  , r  and  have  the  following  divisibility  property:  q(A)  divides  e,+i(A)  and 
xjji+ i(A)  divides  r,  (A)  for  i  =  1, 2,  •  •  •  ,  r  —  1 . 

Definition  C.4.3  MG( A)  is  called  the  Smith- McMillan  form,  ofG(X).  The  roots 
of  I!!  i  q(A)  are  called  the  zeros  ofG(X)  and  the  roots  of  ]  Ji  ,  <.  y  ( A )  are  called  the 
poles  of  G( A). 
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Definition  C.4.4  Let  A0  be  a  zero  ofG(X).  Let  (JgX Ao)  denote  the  multiplicity  of 
A0  as  a  root  ofefiX) .  ogz(Xq)  is  known  as  the  algebraic  multiplicity  of  A0.  The  total 
number  of  indices  i  for  which  <JGi( Ao)  is  strictly  positive  is  known  as  the  geometric 
multiplicity  of  A0. 

Recall  Figure  C.2.  We  denote  the  dimensions  of  w,z,u,y  as  nw,nz,nu  and 
ny ,  respectively.  Without  loss  of  generality,  we  assume  that  U{ A)  has  full  column 
normal  rank  nu  and  V (A)  has  full  row  normal  rank  ny. 

Characterization  of  the  subspace  S  is  given  by  a  set  of  interpolation  conditions: 

Theorem  C.4.5  Let  the  Smith- McMillan  decompositions  of  U  and  V  be  U  = 
LijMijRu  and  V  =  LVMVRV,  respectively.  Let  A uv  denote  the  set  of  zeros  of  U 
or  V  in  the  closed  unit  disk  V.  Define  the  polynomial  row  and  column  vectors: 

A)  =  (Aj/'M A),  i=  1,2,- ••  ,nz 
A(A)  =  (RylY(X),  j  =  1,2,  ••  •  ,nw, 

where  ( M)t  denotes  the  i-th  row  of  matrix  M  and  (M)j  denotes  the  j-th  column  of 
M.  Assume  that  Auv  C  V.  Given  R  G  lfzXriw)  there  exists  a  Q  G  jfu/nv  such  that 
R  =  UQV  if  and  only  if  for  all  A0  G  Auv,  the  following  conditions  are  satisfied: 

1.  Zero  interpolation  conditions: 

*  =  !,•••  ,«u 

(aiRPj){k\ A0)  =  0  for  <  j  =  1,  •  •  -  ,  ny 

k  =  0,  •  •  •  ,  CTufiXo)  +  (Jy.  (Ao)  —  1 

2.  Rank  interpolation  conditions: 

{(»,/?)( A)  =  0  for  i  =  nu  +  1,  •  •  •  ,  nz 
(R0j)(X)  =  0  for  j  =  ny  +  1,  •  •  •  ,  nw 
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Corollary  C.4.6  The  result  in  Theorem  C-4-5  holds  for  Auv  C  'D  if  II  G 

From  Theorem  C.4.5,  if  nw  =  ny  and  nu  =  nz ,  the  rank  interpolation  conditions 
disappear  and  we  call  such  problems  one-block  problems.  A  problem  is  called  multi¬ 
block  if  it  is  not  one-block. 

C.4.2  The  one-block  problem 

The  zero  interpolation  conditions  in  Theorem  C.4.5  can  be  reformulated  in  terms  of 
mill  chains  of  U  and  V,  which  avoids  explicit  computation  of  the  Smith-McMillan 
decomposition.  We  assume  that  locations  of  zeros  are  known. 

Definition  C.4.7  (Null  chains)  Given  a  rn  x  n  real  rational  matrix  E( A)  ana¬ 
lytic  at  A0  and  a  positive  integer  o.  define  the  Toeplitz  matrix: 

(  E0  0  0  •••0 

~  E i  Eq  0  •  •  •  0 

T^(E)  = 

A  E  <J—1  Err  —  2  Ea— 3  •  •  •  Eq 

where  Ei  =  ^(^(Ao),  i  >  0.  A  right  null  chain  of  order  a  of  E{ A)  at  Xq  is  an 
ordered  set  of  column  vectors  in  Rn.  {.ri,x2,  •  •  •  ,  xa } ,  such  that  xg  ^  0  and 


A  left  null  chain  of  order  a  of  E( A)  at  A0  is,  by  definition,  a  right  null  chain  of 
order  a  of  ET  { A)  at  Xq. 
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Definition  C.4.8  A  canonical  set  of  right  null  chains  of  E( A)  at  A0  is  an  ordered 
set  of  right  null  chains,  i.e.,  x1  =  (x\,  •  •  •  ,xft)  for  i  =  1,  •  •  •  ,  l,  such  that 

1.  •  •  ,x\}  are  linearly  independent , 

2.  span{x\,-  ■  ■  ,  x\  }  —  the  nidi  space  of  E( A0),  and 

3.  C7 1  >  (72  >  •  •  •  >  (Jl . 

A  canonical  set  of  left  null  chains  is  defined  similarly. 


Definition  C.4.9  An  extended  set  of  right  null  chains  of  a  full  rank  n  x  n  rational 
matrix  E( A)  at  A0,  is  a  canonical  set  of  light  null  chains  augmented  with  n  —  l 
vectors  in  Rn,  {x\+l ,  •••  ,xf}  such  that  span{x\,x\,  -  ■  ■  ,xf}  =  Rn.  Each  {a?*}  is 
treated  as  a  chain  with  order  0,  I  +  1  <  k  <  n.  An  extended  set  of  left  null  chains 
is  defined  similarly. 


An  algorithm  to  compute  the  extended  set  of  null  chains  is  provided  in  [29], 
page  134. 

Given  an  element  of  an  extended  set  of  right  null  chains  at  A0,  x?  of  order  <jj, 
define 

^o(A)  ^  x{  +  (A  -  A0)4  4 - h  (A  -  Xop-% 

if  a j  >  0  and  -r*  (A)  =  x{  if  aj  —  0.  Similarly,  define  yfifi A)  for  an  element  of  an 
extended  set  of  left  null  chains,  if  ,  of  order  at.  With  this  notation,  we  have 


Theorem  C.4.10  Given  a  one-block  problem,  the  zero  interpolation  conditions  in 
Theorem  C.f.5  are  equivalent  to  the  following:  for  all  A0  £  A/  r- 


iO/ 


Ao 


iT^i0 


)(fc)(Ao) 


*  =  !,•••  ,nu 


0  for  < 


J 

k 


1,  •  •  •  ,riy 

0,  •  •  •  ,  ci/ii A0)  +  o’Vj(Xo) 


(C-17) 
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where  y'  and  xj  are  elements  of  the  extended  sets  of  left  and  right  null  chains  of 
U  and  V ,  respectively,  and  aUt,  aVj  are  the  corresponding  orders. 

The  problem  (C.15)  can  now  be  solved  by  a  linear  programming  approach. 
Recall  a  closed-loop  map  $  =  E  —  R.  Therefore  (C.17)  leads  to  a  set  of  linear 
equality  constraints  on  T: 

A  $  =  h 

where  Azero  is  some  linear  operator  on  b  G  M"c  and  nc  is  the  total  number 

of  constraints  imposed  by  (C.17). 

For  T  G  lfzXnw,  we  define  G  lfzXnw  as  follows: 

&fj[k]  =  max{0,  and  =  max{0,  —  Ty  [A:]}, 

for  all  k  >  0, 1  <  %  <  nz,  1  <  j  <  nw.  Thus  $  =  <h+  —  <F_.  ||  $  ||  i  can  be  expressed 
as  Ai,  (^+  +  T-)  where  Ai}  is  a  linear  functional. 

The  one-block  /  ,  model  match  problem  is  transformed  into  the  following  linear 
program: 

u°  =  inf  //,  such  that  (C.18) 

j/.'i1 '  .'i' 

Ae,0(d>+ -$-)  =  &, 

Ah  (<b+  +  $“)  <  v, 

>  0. 

When  Auv  C  V,  one  can  show  that  the  optimal  T  has  a  finite  impulse  response 
(FIR)  through  analysis  on  the  dual  linear  program  of  (C.18). 

From  the  optimal  T,  one  obtains  the  optimal  controller  by  plugging  the  corre¬ 
sponding  Q  into  (C.4). 
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C.4.3  The  multi-block  problem 


The  rank  interpolation  conditions  in  Theorem  C.4.5  also  imposes  linear  constraints 
on  $,  but  the  number  of  constraints  is  infinite.  Therefore  in  general  the  resulting 
linear  programming  problem  has  infinite  number  of  variables  and  infinite  number 
of  constraints.  Three  approximation  methods  are  available  to  solve  the  infinite 
dimensional  linear  programming  problem: 

1.  Finitely  Many  Variables  (FMV)  approximation:  approximate  $  by  a  finite 
impulse  response  of  length  N  which  results  in  finite  number  of  variables. 

2.  Finitely  Many  Equations  (FME)  approximation:  approximate  the  dual  vari¬ 
ables  by  a  finite  vector  of  dimension  N  which  is  equivalent  to  retaining  finite 
number  of  constraints  in  the  primal  problem. 

3.  Delay  Augmentation  (DA)  approximation:  Embedding  the  problem  into  a 
one-block  problem  by  augmenting  U  and  V  with  N  pure  delays. 


The  DA  approximation  carries  richer  information  about  the  structure  of  the 
optimal  solution  than  FMV  and  FME.  It  is  also  the  method  we  use  in  our  compu¬ 
tation.  We  now  give  a  brief  introduction  to  the  DA  method. 

We  first  partition  the  original  system  (C.6)  as: 


$11  $12 

y  d>2l  $22 


'  En  E12 
E21  E22 


cm  v2), 
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where  U\  G  /""  ‘ "  and  Vj  G  l™yXUy .  Denote  by  SN  the  Ar-th  order  delay  operator, 
i.e.,  Sjv  =  AjV.  Augment  U ,  V  and  Q  accordingly: 


/  \ 

(  \ 

$  11,1V  $  12,1V 

A 

En  Eu 

y  $21, N  $22, IV  J 

y  e21  e22 

<*>  Tv- 


Theorem  C.4.11  For  N  >  0,  let 


u°  =  inf  ||  A  -  UQV  ||  i,  and 


7/  =  $°v  =  inf  ||  $w  ||i 

-A  Qrteqz<nw 


If  $°N  is  achievable,  let  Q%  be  the  corresponding  minimizes.  Let  t]n 
UQ°nV  || .  Then 


E  - 


VN  <  v  <  Vn- 


(C.19) 


Proof  We  have 


v 


o 


inf  ||  $v  ||:i>  ??»,• 

Qu  €/""  X"iyQl2=Q21=Q22=0 


Sine  Qii  ^  /|,“Xnw,  it’s  clear  that  z/°  <  r/.y.  ■ 

Theorem  C.4.11  tells  us  that  we  can  obtain  a  sub-optimal  controller  by  plugging 
Qu  into  (C.4). 

Convergence  results  for  the  DA  method  can  be  found  in  [29] . 
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